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ON UNIVALENCE OF A GENERAL INTEGRAL OPERATOR 



AISHA AHMED AMER AND MASLINA DARUS 



Abstract. Problem statement: Wc introduce and study a general integral 
operator defined on the class of normalized analytic functions in the open 
unit disk. This operator is motivated by many researchers. With this operator 
univalcncc conditions for the normalized analytic function in the open unit disk 
are obtained. In deed, the preserving properties of this class arc studied, when 
the integral operator is applied and we present a few conditions of univalency 
for our integral operator. The operator is essential to obtain univalence of 
a certain general integral operator. Approach: In this paper wc discuss 
some extensions of univalent conditions for an integral operator defined by 
our generalized differential operator. Several other results are also considered. 
We will prove in this paper the univalent conditions for this integral operator 
on the class of normalized analytic functions when wc make some restrictions 
about the functions from definitions. Results: Having the integral operator, 
some interesting properties of this class of functions will be obtained. Relevant 
connections of the results, shall be presented in the paper. In fact, various other 
known results are also pointed out. We also find some interesting corollaries on 
the class of normalized analytic functions in the open unit disk. Conclusion: 
Therefore, many interesting results could be obtained and we also derive some 
interesting properties of these classes. Wc conclude this study with some 
suggestions for future research, one direction is to study other classes of analytic 
functions involving our integral operator on the class of normalized analytic 
functions in the open unit disk. 



1. Introduction 

As usual, let U = {z E C : \z\ < 1} be the unit disc in the complex plane and let 
A be the class of functions which are analytic in the unit disk normalized with 
/(0) = /'(0) — 1 = 0. Let S the class of the functions / e A which are univalent 
in U. In particular, for / € A and (z € U, b ^ 0, — 1, —2, —3, ...), A > 0, m <G Z, I > 0, 

the authors (cf., [1, 2]) introduced the following linear operator: 

Definition 1.1. For f e A the operator D™' (a,b)f(z) is defined by D™' (a,b)f(z): 

A — > A and let 



l + l - A z A z 

l + l \-z + 1 + 1(1 -z) 2 '- 



and 



Dl n ' X (a, b)f(z) = <j>(z) * ... * 4>(z) *zF(a, 1; b; z) * /(*), 



(m)— times 



if (m = 0, 1, 2, ...), and 



Key words and phrases. Analytic functions; Univalent functions; Derivative operator; 
Hadamard product; Unit disk; The complex plane. 

2010 AMS Math. Subject Classification. Primary 40A05, 40A25; Secondary 45G05. 
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D?<\a,b)f(z) = <Kz) 



(z)*zF(a,l;b;z)*f(z), 



(— m) — times 

if (m — — 1, —2, ...), thus we have 



D?' X (a,b)f(z):=z + J2 



k=1 



l + A(fc-l) + / ^ m (a) fc _i fc 
1 + / ) (&) fc -i° fc *' 



where f <E A and (zeU,6^0, — 1, —2, —3, ...), A > 0, to € Z, Z > 0. Special cases of 



this operator includes: 



iO,A, 



. ^ m ' (a,&)/(z)-^ A (a,6)/(z)-L( a ,6)/(^), see [11]. 

• TTie Ruscheweyh derivative operator [12] m ifte cases: D ' (/3 + 1, l)/(z) = 
Df>f(z);0>-1. 

• TTie Salagean derivative operator [13]: D™' (l,l)/(z). 

• TTie generalized Salagean derivative operator introduced by Al-Oboudi [14]; 



rC A (i,i)/(*). 

TTie Catas drivative operator [10]: -D[™' (l,l)/(z). 



2. Preliminary Results 

To discuss the univalency of / € A, we have 
Definition 2.1. 

Theorem 2.2. [5] ^Isswrne i/iai / G A satisfies condition 

z 2 f'(z) 



(2.1) 

i/ien / is univalent in U . 



f 2 (z) 



1 



< l, z e U, 



Theorem 2.3. [6] Let a be a complex number, 3?a > and f(z) = z + a2Z 2 + . . 
is a regular function in U. If 



(2.2) 



i-k 

3ta 



■ lH " zf"(z) 

1W 



<1, 



/or aZZ z G Z7, Z/ien /or an?/ complex number /3, 3?/3 > 3?a i/ie function 



(2.3) L>(z) - 

zs regular and univalent in U 



(3 I u 13 - 1 f{u)du 
o 



z + 



Lemma 2.4. (Schwarz Lemma [3]) Lei /(z) the function regular in the disk 
Ur — {z <G C; |z| < T?} , w«i/i |/(z)| < M, M fixed. If f(z) has in z = one zero 
wii/i multiply > to, i/ien 



(2.4) 



l/WI<^N m - zeC/ «> 



i/ie equality (in the inequality (2.4) for z ^ 0) can hold only if /(z) = e -M^ z m , 
where 9 is constant. 
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3. Main Results 

Theorem 3.1. Let g € A,*y be a complex number such that 3^7 > 1, M be a real 
number and M > 1. If 

(3.1) \z(D™' x (a,b)g)'(z)\<M, zeU, 
and 

(3.2) | 7 | < 
then the function 



3^3 
2M' 



(3.3) T 7 (z) 

is in the class S. 

Proof. Let us consider the function 



7 f Z u-r- 1 (e D "^ a ^^y du 



(3.4) 



M 



which is regular in U. The function 



(3.5) 



Hz) 



e DT-\a,b )g( u)y du 



1 zf"(z) 



|7| f'(z) ' 

where the constant I7I satisfies the inequality (3.2), is regular in U. From (3.5) and 
(3.4) it follows that 



(3.6) 



h{z) = -^-z{D^(a,b)g)'(z). 

|7| 



Using (3.6) and (3.1) we have 

(3.7) \h(z)\ < M, 

for all z € U. From (3.6) we obtain h(0) = and applying Schwarz-Lemma we 
obtain 

(3.8) A C^V < M \A, 



f'(z) 



for all z € U, and hence, we obtain 

zf"{z) 



(3.9) 



(1-N 2 ) 



/'(*) 



<| 7 |M|^|(l-|z| 2 ) 



Let us consider the function Q : [0, 1] — ► 9?, Q(x) = x (l — a: 2 ) , x = \z\. We have 

2 



(3.10) 



Q{x)< 



3^3' 



for all x e [0, 1]. From (3.10), (3.9) and (3.2) we obtain 



f'(z) 



<1, 



(3.11) (l-|z| 2 ) 

for all z <= U. From (3.4) we obtain f'(z) = U"''^' 2 )] 1 . Then, from (3.11) 
and Theorem 2.2 for $ta = 1 it follows that the function T 7 is in the class S. This 
is the proof. □ 
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Corollary 3.2. In the special case m = 0,a = b = 1, Theorem 3.1 yields a result 
given earlier by [9] . 

Theorem 3.3. Let D7 1 ' (a,b)g G A, satisfy (2.1), 7 be a complex number with 
K 7 > 1, M be a real number, M > 1 and \j - l\ < (12M « +1)V £ff + i + 36M«-i - # 



D?'\a,b)g{z) 



<M, zeU, 



then the function 

H 7 (z) 

is in the class S. 
Proof. We observe that 

HJz) ■- 



7 / u 





27-2 



7-1 



,DT' X {a,b) g {u)y du 



7/ v?- 1 (ue D T' X ^ b )9My du 



Let us consider the function 

h{ Z ): 



ae D^(a,b) g (u)\-< l ^ 



The function h is regular in U . We obtain 



h"(z) _ z{D^\a,b)g)'{z) + l 

h'{z) l7 j z 



and hence, we have 

(3.12) (l-|z| 2 ) 



zh"{z) 

W) ~ 

for all z E U. From (3.12) we get 



| 7 -1|(1-M 2 ) z(D^(a,b)9)'(z) + l 



(1-N 2 ) 



zh"{z) 



h'(z) 



<|7-1|(1-N| 2 ) 



z 2 {D^ X (a,b)g)\z) 



(D^(a,b)9) 2 (z) 



{D™>\a,b)9?{z) 



for all z G U. By the Schwarz Lemma also 
obtain 

zh"(z 



D™'*{a,b)g{z) < M\z\, z € U and we 



1-N' 



h'(z) 



<|7-1|(1-N| 2 ) 



Jii n m ' A 



^n^MM_! 



\n\.\ 



(D?>*(a,b)g)*(z) 



M 2 \z\+M 2 \z\ + 1 



-,m.A 



for all z ^ U. Since Z^ ' (a, &)<? satisfies the condition (2.1) then, we have 



(i-N 2 ) 



zh"(z) 



h'(z) 



<|7-l|(l-k| 2 )(2M 2 |^| + l), 



for all z€U. Let us consider the function G : [0, 1] -*■ K, G(x) = (l - x 2 ) (2M 2 x + 
1), a; = \z\. We have 
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(12 M 4 + l) Vl2M 4 + l + 36 M 4 - 1 



G(x) < 



54 M 4 



for all x _ [0, 1]. Since | 7 - 1| < {12Mi+1) J^ i+1+36Mi _ v we conclude that 



zti'(z) 



<1, 



(3-13) (1-N 2 ) m 

for all z E U. Note that, by (3.13) and Theorem 3.1 for 3? a = 1 imply that the 

function H 1 is in the class S. This is the proof. □ 

Corollary 3.4. In the special case m — 0, a = 6 = 1, Theorem 3.3 yields a result 
given earlier by [9] . 
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ON EXACT VALUES OF MONOTONIC RANDOM WALKS 
CHARACTERISTICS ON LATTICES 

ALEXANDER P. BUSLAEV AND ALEXANDER G. TATASHEV 



Abstract. A monotonic random walk of particles on a onc-dimcnsional cir- 
cular lattice is considered. Some fixed number of particles moves on a ring, 
which contains a fixed number of cells. Each cell cannot contain more than 
one particle simultaneously. Jumps of particles can be occurred in discrete 
times. This jumps are realized with probability depending on the type of the 
particle and coordinate of the cell occupied by the particle, and the particle 
comes to the cell adjacent to the particle that is ahead, i.e., the length of the 
jump is maximum possible. A Markov chain is considered that describes the 
behavior of the model. The flow intensity and the average velocity of particle 
have been found. 



1. Introduction 

Models that described traffic in terms of cellular automata were introduced by 
K. Nagel et al., [8-10]. In these models particles are moving on a one-dimensional 
infinite lattice or ring one. The time is discrete. Each cell can be occupied by no 
more than one particle. The number of particles on a ring lattice is defined and 
remains to be constant. The models were considered in that the particles move one 
cell forward, the model in that the particles move immediately to the next particles 
going ahead, and also some generalizations of these models. The allowed movements 
are realized with given probabilities. The main investigated characteristics are 
particles flow intensity and average velocity. Simulation and heuristic approaches 
was used for investigation of these models. Some exact results on these themes were 
presented in [6]. Namely, an open chain of cells was considered, and the stationary 
state of the particles flow was investigated. A particle leaves the system when it 
has passed through all the cells. An algorithm has been elaborated for calculations 
of the state probabilities and the average particles velocity. 

M. Blank articles [1-3] contain exact mathematical results for models of de- 
scribed type, where particles move on the infinite one-dimensional lattice. The 
possible transitions of particles arc realized with probability equal to 1. Random- 
ness occurs only in the choice of the initial configuration of particles. 

An exact formula for the average particle velocity of random walks on a ring 
lattice was obtained in [5], where it was supposed that a particle passes one cell 
forward with a fixed probability p, < p < 1 , provided the cell ahead of the particle 
is empty. 

In [4], the limit formula for the average particle velocity is proved for the case 
when the number of particles and the number of cells tends to infinity so that 
the ratio of particles number to cells number tends to a constant. This formula is 
analogous to the formula for the average velocity of particles that was obtained in 
[9] by a heuristic method. 

l 



17 



2 A. P. BUSLAEV AND A.G. TATASHEV 

We consider here a monotonic random walk of particles on a one- dimensional 
circular lattice. Jumps of particles can be occurred in discrete times and are realized 
with probability depending on the type of the particle and coordinate of the cell 
occupied by the particle. A moving particle comes to the cell adjacent to the particle 
that is ahead (model with maximum transitions). The flow intensity and average 
velocity of particle have been found. 

If the number of particles is equal to the number of cells minus one, then con- 
sidered model is identical to the model where particles pass only one cell forward. 
Then the obtained formulas can be regarded as asymptotic formulas for the average 
velocity of particle in the model in that particles pass one cell forward, as the flow 
density tends to maximum value, which is equal to 1. 

2. NONHOMOGENEOUS CHAINS WITH MAXIMUM JUMPS OF PARTICLES 




2nd type 
1st type 

Figure 1 . The model of monotonic random walk 

Let us describe a stochastic model of particles movement on a closed sequence of 
cells, Fig 1. Let the number of cells be equal to n. The particles move at the discrete 
times 1,2,..., in the same direction. The (i + l)-th cell follows the i-th cell in the 
direction of movement, i — 1, ... ,n — 1. The cell 1 follows the cell of number n. 
Each cell is occupied by no more than one particle. There are k types of particles. 

The number of particles is equal to m, 1 < m < n. There are m s particles of the 

fc 
type s, s = 1, . . . , fc; ^ m s — m. The (z + l)-th particle follows the z-th particle 

s=l 

in the direction of movement, i = 1, . . . , m — 1. The particle 1 moves ahead of the 
particle m. Suppose the i-th particle is a particle of the s(i)-th type, i = 1, . . . , m. 
If a particle of the s-th type, s = 1, . . . , fc, occupies the i-th cell and there are d 
empty cells in front of this particle, then the particle passes the maximum number 
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of cells with the probability < pi S < 1, i.e., it comes to the cell adjacent to the 
particle ahead. Hence the order of the particles can not be changed. Configuration 
of the particles at the initial time is fixed. 

Assume that the numbers n and m are coprime. 

The particles behavior can be represented by a Markov chain with discrete time, 
[7]. Each state of chain corresponds to the vector (ii, . . . ,i m ), j-th component of 
which is equal to the index of the cell occupied by the j-th particle, j = 1, . . . , m. 
The number of states is equal to mC™ , where C!" = ,, "• — rr. Suppose the states 

" n : n m\{n—m)\ ^^ 

of the Markov chain are numerated arbitrary. 

A group of particles is called a cluster if there are no free cells between the 
particles of this group. 

If the chain returns to some state with a probability less than 1, then this state 
is called non-recurrent, [7]. 

Suppose some state j exists such that the chain can come to the state j from the 
state i with non-zero probability and cannot return to the state i from the state j. 
It is evident that the state i is non- recurrent. 

Every chain state with more than one cluster is a non-recurrent state. Really, 
the chain can come from such a state to a state with a smaller number of clusters 
and cannot come to any state with a greater number of clusters. 

Let Pi(j) be the probability of that the chain is in the i-th state at the time j, 
i = l,...,mC™ j = l,2,... ' 

It is known from the theory of Markov chains, [7], that for each non-recurrent 
state i 

]imP i (j)=0, (1) 

Thus (1) is true for each state with more than one cluster. 




Pis 



Figure 2. A state with one cluster 

The class of states with one cluster (Fig. 2) contains ran states. The states of 
this class are described by the index of particle moving in front of the cluster and 
the index of cell that is occupied by this particle. Let the state for that the first 
particle of cluster is the j-th particle, occupying the i-th cell, be called the state 
Eij, i = l, ...,n, j = 1, ...,m. 

Suppose that the state Ejj is the chain state at the time k. Then at the time 
k + 1 the chain comes with the probability Pi S (j) to the state E^j* , where i* =i—l, 
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if z = 2, . . . , n, and i* = n, if i = 1; j* = j — 1, if J* = 2, . . . , to, and j* = to, if 
7 = 1. And the state E^ is the chain state at the time k + 1 with the probability 

1-P is (j). 

Lemma 1. Suppose Ei j is some state with one cluster. Let a — 1 be maximum 
possible number of the different states of the chain after passing that the chain can 
return to the state Ei j . The value a does not depend on ig and jo, and equals mn, 
i.e., equals number of states with one cluster. 

Proof. The chain, after leaving a state of the class, returns to this state having 
been at a — 1 transit states, where a is the least common multiple of the numbers 
to and n. Since the numbers n and to are coprime, then a = ran, i.e., a is equal to 
number of the states of the class. Thus Lemma 1 is true. □ 

Let q be the flow intensity, i.e., the average number of particles passing through 
a cell per a time unit: 

_. n m 

q = n J2 J2 P ijPis(j) {n-m). ( 2 ) 

Denote 

r=—,r s = — , s = l,...,k. (3) 

n n 

The value r is the particles flow density. The value r s is the flow component 
composed by particles of the s-th type. 

The flow intensity, the flow density, and the average velocity v are related by the 
formula 

q = rv. (4) 

Theorem 2. For the flow intensity and average velocity of particles, the following 
relations are true 

nr(l — r) 



n k 

EE 



(5) 



Pi 

ls=l 



^^ (6) 



EE£r 

i=l s=l 

Proof. It follows from Lemma 1 that the class of the states with one cluster is the 
unique class of the communicating states, i.e., it is possible to come from each state 
of the class to each other one with a non-zero probability. 

This states are non-periodic also, i.e., the greatest common divisor of the values 
of possible time of the return to the state is equal to 1. Really, at each time the 
chain state can remain the same, and the time of recurrence can be equal to any 
number that is not less than ran. 

According to a theorem of the Markov chains theory, [7] , there exists a non-zero 
steady probability for each state that belongs to the unique class of the communi- 
cating non-periodic states of the Markov chain with a finite number of states, i.e., 
if i is a state of this class, then the limit exists 

P t = lim Pi(j) > 0. 

j-HX 
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Let Pij be the steady probability of the state Eij, i — 1, . . . ,n, j — 1, . . . ,m. 
Steady states satisfy the system of equations 

Pi* s{j*)*i*j* — Pis(j)*ij ) & 1, . . . , 71, J 1, . . . , 771, 

n m 

»=i j=i 

where i* and j* are defined as above. 

The solution of this system of equations is 

p tj = n m lS{j — , i = l,...,n; j = l,...,m. (7) 

Formula (5) follows from (2), (3), and (7). 
Denote 

P = — • ( 8 ) 

i=ls=l 

Taking into account (8), we can rewrite (5) as q = (1 — r)p*. Combining (2), (7), 
and (8), we get (6) and, therefore, 

v = (--l)p*. (9) 

r 

Thus Theorem 2 has been proved. □ 

Corollary 3. If m = 1 and pi S = 1, then 

v= 1 -l. (10) 

r 

Proof Formula (10) follows from (6). 

Formula (10) corresponds to a formula that is given in [3] for a similar model of 
movement on a one-dimensional lattice. □ 

3. Remarks 

Suppose m = n—1. In this case the behavior of the model considered in Section 2, 
is identical to the appropriate models where the particles move no more than one 
cell forward. Hence relations (5) and (6) can be regarded as asymptotic formulas 
for the chain of big density, where particles move no more than one cell forward. 

Suppose that the probability of displacement of particles does not depend on the 
particle type and the index of cell occupied by the particle 

Pis =p, i = l,...,n,s = l,...,k. (11) 

If n and m tend to infinity so that ratio m/n tends to r, then following relation is 

true [4], 

1- Jl - Wl - r) 

Vl = o ' 

2r 

where v\ is the average velocity of particles in the model where the particles move 
no more than one cell forward. 
Combining (6) and (11) we get 

v = v 2 = { l)p. 

r 
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We have lim — = 1. This confirms adequacy of formulas (5) and (6) as as- 
ymptotic for the appropriate models in that particles move no more than one cell 
forward, for flow densities near 1. 

4. Conclusion 

Formulas for average intensity and average velocity are obtained for the model of 
random walks on a ring lattice. The probability of a particle transition depends on 
the particle type and the index of the cell occupied by the particle. The particles 
jump forward to the cell adjacent to ahead of particle. 
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EDEGEWORTH BLACK-SCHOLES OPTION PRICING FORMULA 



ALI YOUSEF 

ABSTRACT. In this paper we consider the Black-Scholes option pricing problem. 
We use Edgeworth second order approximation to approximate the underlying asset's 
return distribution. We state and prove Lemma 3.2 which finds a closed form for the 
solution of the Black-Scholes partial differential equation under Edgeworth 
approximation. The theoretical form of Edgeworth option pricing formula found in 
Lemma 3.2 depends mainly on the skewness and kurtois of the asset's return 
distribution which indeed corrects the classical Black-Scholes formula that was 
presented by Black and Scholes in 1973. Moreover, we find a simple form of delta 
hedging. In the end we find standardized Edgeworth expansions for the following 
distributions; lognormal, student t-distribution and chi-squared distributions. 



1. INTRODUCTION 

The Edgeworth expansion was first introduced by Edgeworth in 1905, see [11], 
as an expansion representing one distribution function in terms of another 
distribution, in such a way that the cumulants of the other distribution function 
should be known. Since its foundation in 1905, it became the center of many 
statistical studies and has been applied in many fields like, economics, finance and 
also in engineering, see [13]. The asymptotic behaviour were developed by [25], and 
its validity region was discuused by [4]. In the following section, we give some 
accounts about Edgeworth second order expansion and its validity region. 



2. EDGEWORTH SECOND ORDER EXPANSION AND ITS 
ASYMPTOTIC PROPERTIES 

Let (X 1 ,...,X n ) be a sequence of IID random varaibles of size n from a 
continuos distribution function F , such that the mean // e 9t , the variance <r 2 < oo , 
the skewness y and the kurtosis f3 are all finite, where, y = a^ E{X - jS) <ooand 
P = a^E(X - /uf <oo. 

Let X n = n l Y t X l , Z„ = -Jn{X n -ju)/(T and F„ (x) = P(Z n < x) . 

1=1 

Then the Central Limit Theorem states that the limiting distribution of Z n , that is, 



Key words and phrases. Black-Scholes formula, Chi-Squared, delta hedging, Edgeworth expansion, 
Lognormal, Student t-distribution. 
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limF n (x) = <D(x):= [(2^) exp(-t 2 /2jdt, VxeSR is a standard normal 

cumulative distribution function, regardless the analytic form of the underlying 
distribution function F. 

If we are interested to find the probability distribution of Z n before reaching the 
CLT then this will leads us to Edgeworth series, which can be derived by expanding 
the logarithm of the characterstic function of Z n using taylor series expansion in 
t and collecting terms of the same order in n and then applying the inverse fourier 
transform to obtain the Edgeworth expansion, see, [18], chapter 4, page 91. Note, we 

need limsup isle"*' ) <1, to ensure the expansion of the characterstic function of 

z„. 

In the following line we state the two term Edgeworth expansion without proof 
as stated in [9], Theorem 13.1 page 186. 

Theorem 2.1. (Two-Term Edgeworth Expansion) Suppose F satisfies the Cramer's 
condition and E F ( X 4 J < oo . Then, 

F n (x) = <D(x) + ^(x)|c 1 p 1 (x)m~" 2 +{c 2 p 2 (x) + c 3 p 3 (x))n~ l \ + 0[n~ 312 ). 

defined for all values of X and as n — > oo . Note that (j> — O is the standard normal 
density function, 

c, =y/6 ,c 2 =(j3-3)/24,c 3 =/ 2 /l2,p l =l-x 2 ,p 2 =3x-x 3 and 

p 3 =-15x + 10x 3 -x 5 . 

Note thatCjand c 2 are called the skewness and kurtosis corrections of the 
underlying distribution function. From Theorem 2.1 and by direct substitutions for 
C l ,C 2 ,C 3 ,p l ,p 2 and p 3 and by taking n — 1 , we have the heuristic result 

(2.1) F(x) = (D(x)- — ^x(x 4 -10x 2 +\5)y 2 --y<j>(x 2 -l) 



_J_«te(**_3)(/*-3) + 0(l) 



24 

If the distribution function F of an absolutely continuous random variable 
admits an Edgeworth expansion, then we can obtain an expansion of the density 

function heuristically by differentiating (2.1Jwith respect to x . Hence the 

probability density function of the Edgeworth expansion is 
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(2.2) f(x) = *(x) 



l + L /x ( x i _ 3 ) + ±(^_3)( x 4 _ 6x 2 +3 J 
+ — X 2 (x 6 -15x 4 +45x 2 -15) 



> + 



0(1) 



Equation (2.2) shows how to represent a continuous probability density function in 

terms of the standardized normal probability density function and which is an 
approximating standardized density with the desired skewness and kurtosis. The 

density function in (2.2) is called a standardized Edgeworth asymptotic expansion; 

see [19]. 

The Edgeworth expansion is more useful in many applications than other 
asymptotic series such as the Gauss-Hermite and Gram-Charlier series, see [6], This 
is so, because, it is directly connected to the moments and cumulants of a probability 
density function, a property that is lost in the Gauss-Hermite series. Secondly, it is a 
true asymptotic expansion since the error of the approximation is controlled by 

estimating the error of the expansion until the order O ( rC in \ . While, the 

disadvantage of the series was shown by [3] that the Edgeworth series can give 
negative values for some values of x . They found the region in the plane of values 
of skewness and kurtosis where the density is positive. This region was further 
studied by [10] in detail using numerical methods. They found that the validity 
region that ensures the Edgeworth series to represent a positive definite and 

unimodal probability density function is R = j I y I < 0.45, 3.0 < /? < 5.35 } . If the 

parameters lie outside the validity region (as we shall see in the last section), the 
results may be misleading. Furthur analytical investigations about the validity region 
were undertaken by [1]. References for the main results on Edgeworth expansions 
are given in [2,4,15]. 

In the next section, we consider the Edgeworth Black-Scholes option pricing 
problem. 

3 . EDGEWORTH OPTION PRICING FORMULA 

In the early 1970's Black and Scholes made an important contribution in the 
pricing of complex financial instruments by developing what so called the Black- 
Scholes model. The model based on a formula that describes the market with two 
financial goods: a risky security, such as a stock(risky asset) and a riskless bond. An 
option is a contract that gives the investor the right to buy or sell the asset or part of 
it without any obligations subject to certain conditions within a specific period of 
time. The price that is paid for the asset when the option is exercised is called the 
strike price K , while the expiration day is the last day on which the option may be 
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exercised. If the option can be dealt only at maturity time t , then we have a 
European-style, otherwise, we have an American style. 

The most important application of Ito's lemma in the area of financial 
mathematics is option pricing and the most interesting result in option pricing is the 
Black-Scholes formula. To simply our discussion assume that the model that can be 
used to represent the random price S t of a risky asset follows a geometric Brownian 
motion, 

(3.1) dS t =MS t dt + o-S,dW t 

Where, \ S t , t e S JT j is a stochastic processes, /J is the average rate of the asset price 
growth, judt is the drift mean, a is the volatility, adW t is the random change in the 
asset price and W is a Wiener process. 

Equation (3.1) is called a stochastic differential equation and states that the return of 

in 

the asset — - consists mainly of two main components; a constant return /udt and a 
random return a dW t . 

By applying Ito's Lemma, see [24], Lemma 3.2,page 96 to (3.1) we obtain the 
Black-Scholes partial differential equation for a European call option price C t . 

i~ „\ 9C, 1 ,„ 7 3 2 C, DC, „ 

3.2) — '- + -a 2 Sf f + rs — L = rC, 

v ' dt 2 dSf DS, 

Equation (3.2) is called a parabolic partial differential equation, and r is the risk 
free rate. 

It was shown from [22] that the payoff of the European call option is given by 
C, =max{5', -K,0}, where S t is the price of an asset at maturity time t and K is 
the strike price. 

Now, since S t satisfies, (3.1) then we can use the substitution Y t = log (5, ) and 
Ito's Lemma to obtain 

which yield the following solution 



S, =5 exp \(n-<j 2 l2)ds + \<jdW 
VO o J 
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(3.3) S, =S Q exp(( J Li-<j 2 /2)t + <jW,)yt£ i >R + ,W, DN(0,t) 

Note (3.3)is the exact solution of (3.1) and log(S,)D N^ju-a 2 /l)t, a 2 t\. 

Under Q (risk neutral probability measure) , the average price of the asset should 
satisfy the following equation 

(3.4) E(S,) = S exp(rt) 
Hence from Q , the value of the call option at time zero is 

C =e X p(-rf)£ G [(5,-^) + J. 

The solution of (3.2) yields the celebrated Black-Scholes formula for the European 

call option at time zero with a non-dividend paying stock and with strike 
price K and maturity time t 

(3.5) Cf =S <S(d 1 )-Kexp(-rt)(b(d 2 ), 

ln(S /K) + (r + a 2 /2)t 

rf, = '■= ,d 2 =d 1 -cryf. 

o4t 

Where S is the current asset's price at time zero, a is the volatility of the relative 

price change of underlying asset price, r is the short term risk free interest rate, K is 
the strike price and t is the maturity time of the option. Note, all the above 
parameters can be obtained from the market data except a . For more details see, 
[5]. Volatility is a measure of riskiness of an asset, the more volatile it is, the more 
risky it is. Technically, it measures the standard deviation of short term returns on 
the asset, which can be estimated either from the historical price path of the asset, or 
using discrete method. For more details about volatility see, [7]. 

The disadvantages of (3.5) lies in two main points; first it assumes that the 

volatility of an asset remains constant all the period time, irrespective of the 
direction of price movements. Second, it assumes that the distributional form of the 
asset returns should be normally distributed, which is inconsistent with the observed 
data in the market. It was shown by many statisticians and observers that the 
distribution of assets return follows a non-normal distribution, presenting heavy tails 
and asymmetry. For more details, see [14, 21]. The point lies that the normality 
assumption of the Black-Scholes model does not capture extreme movements such 
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as stock market crashes. To resolve this problem several series approximation 
techniques were used: [8], used Gram-Charlier expansion to approximate the 
return's asset distribution, while [12] used Edgeworth approximation. [17] used the 
generalized Edgeworth expansion of the lognormal distribution of asset prices to 
obtain a model that adjust the Black-Scholes formula for the higher moments in the 
underlying distribution. 

In this paper, we will find a closed form for the theoretical European call option 
price assuming the return's asset distribution is an absolutely continuos distribution 
function, provided the first six moments exist. Our technique based on using 
Edgeworth second order approximation which has a bounded error. 

To proceed our solution we need the following Remarks which will enhanced our 
main result in Lemma 3.2. 

Remark 3.1 

Let / , (y)be a standardized Edgeworth probability density function defined as in 
(2.2J and let (j) and <Dbe respectively, the probability density and distribution 
function of TV (0, l) , then 

\f{y)dy = ®{z)+<l>{z)Q{y,p,z), 

e(r,A Z ) = {^( Z 4 -i0z 2 + i5) + ^(z 2 -i) + ^ z (z 2 -3)}. 

Remark 3.1 follows immediately by expanding the integral over the terms and using 
the following identities, 

cc cc cc co 

j<f>(x)dx = ®(z), $x0(x)dx = <p(z),$x 2 <p(x)dx = <t>(z) + z<p(z), $ x 3 <p(x)dx = <p(z)(z 2 +2), 

cc co 

\x*<f>(x)dx = m(z) + z(t>(z){z 2 +?) and J x 6 <p(x)dx = 15®(z) + z<p(z)(z 4 +5z 2 +15). 



The above integrals can be found easily from integration by part and the properites 
of error functions. Note <I>(z) = !-<!>( z). 
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Remark 3.2 

Under the same condition of Remark 3.1, we have 

CO 

$exv(by)f(y)dy = exp(b 2 /2)®(-z + b)f(r,/3) + 0(z)exp(bz)g(r,j3,b,z). 

where, 

T t = z 5 +bz 4 +(b 2 -l0)z 3 +b(b 2 -6)z 2 + (b A -3b 2 +\5)z + b(b A -b 2 +3), 
T 2 =(z 2 +bz + b 2 -l) and T 3 = (z 3 +bz 2 +(b 2 -3)z + b 3 -b). 

Remark 3.2 can be verified by using fheidentitiy exp [by) <fi(y) dy = expib 2 /2J<t> (p - z] 

z 

Which can be proved as follows 

\w${by)<t>{y) dy = -p==jexp(by-y 2 /2) dy = -= Jexpl --[y 2 -2by) dy 



= ^]^p(-^(y-bf+b 2 /2Jdy=^ & xp(b 2 /2)] & xp(-^(y-b) 2 ]dy. 

1 °° ( 1 ^ 

Let u = y-b^> du = dy^> exp(b 2 /2) [ exp — u 2 du 



= -^exp(b 2 /2)<D(z-b) = -^exp(b 2 /2)<D(b-z). 

42tc y2;r 

and using the following identities; 

co 

j* (u+b)<f>(u)du = <f>(b- z) + (b/2)(l+2®(b- z)), 

z-b 

GO 

j" (u+b) 2 <j>{u)du = cf>(b- z)(z + b) + (l/2){b 2 +l)(l + 20(b- z)), 

z-b 

GO 

j" (u+bf <f>{u)du = <f>{b- z)(z 2 + zb + b 2 +2) + (l/2)b{b 2 +3)(l + 20(b- z)), 
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\U +b) (f>(u)du - 

z-b 

<t>{p- z)[z + z 2 b + z(b 2 +3) + b 3 + 5b) + (l/2)(b 4 + 6b 2 + 3)(l + 20>(/?- z)) 
and 

00 

(u + b) <f>{u}du - 

z-b 

<f>(b- z)(z 5 + z 4 b + z 3 (b 2 +5) + z 2 (9b + b 3 ) + z(b 4 +12b 2 +15) + b 5 +14b 3 +33b) 
+ (l/2)(b 6 +15b 4 +45b 2 +15)(l + 2<£>(b- z)). 

The risk free rate under the classical Black-Scholes formula (underlying asset's 
return distribution is standard normal) is /u = r , wheras under the Edgeworth series 
will be derived in the following Lemma 3.1. 

Lemma 3.1 

Under Q ,the value of the risk free rate that satisfies (3.4) is, 

rt = jut + In / (y, P,a,t) where, 

f^,^a,t) = \^ Y 2 (a4t)\\ Y (cy4t)\h l 3-3)(a4t) A +1 



J2 \ 1 6 v ' 24 
Proof 

E(S l )=limfS sxp[( / u-a 2 /2)t + a^ty)f(y)dy 
= 5 exp((//- 0-72)?) Urn J &vp\a<Jty}f(y)dy 

Recalling Remark 3.2 , we obtain the following 

* w -w*-^w to ^ K '', + * )/0 ' , ' ) ] 

V ' z ~° {+ct>{z)exv{bz)g{y,p,b,z) \ 

= S exp((jLi-a 2 /2)t)limexp(b 2 /2)®(-z + b)f(y,j3), 

+ S expH > u-cr 2 /2jn lim 0(z)exp(bz)g(y,fi,b,z), 
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£(5,)=5 () /exp((//-c7 2 /2)f)exp(^72) lim 0(-z + b) 
+ 5 exp((//-o- 2 /2)f)lim^(z)exp(fc)^(z), 

Since lim ®(-z + 6) = l, then 

E(S,) = SJexp(( M -a 2 /2)t)exp(b 2 /2) 

+ 5 exp((//-o-72)f)lim^(z)exp(fe)^(z), 

Let b = a4t, then 

£(5j = 5 /exp(//?) + 5 exp((//-cr72)r)_Um^(z)exp(o-^z)g (z) 

By completing the square we obtain 

E(S,) = S f exp(jut) + S &xp{fM ) lim </>\-z + o^/f Jg (z) 

But, hm ^(-z + cr%/f ] z" = 0, for all t > and ueD + 

Which implies that £'(5', ) = S„f exp(//f ). 

But we need, 

E(S, ) = S exp(rf) => 5 exp(jut)f(y,J3) = S exp(rt) 

By taking the natural logaritms for both sides we obtain 

rt = jut + \nf(y,p). 

Proof is completed. 

As a result from Lemma 3.1 , (3.3) reduced to 



(3.6) S, =- 5 -exp((r-cr72)f + oW,) ,W,UN(0,t) 



Now we are ready to state and prove the main result in Lemma 3.2. 
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Lemma 3.2 

Let Q be the risk-neutral probability measure, and let S and C be respectively the 
asset value and a European call option at time zero, with strike price K and maturity 
time t , and let r be the risk free rate that satisfies Lemma 3.1, then, for b - o4t , 
the Edgeworth Black-Scholes option pricing formula formula is 



Cf s =S o <t>(d)-Ke- r '(t>(d-b) + S o 0(d)^—Kexp(-rt)0(d-b)Q(y,/3,d) 



ln(S /Kf) + (r + a 2 /2)t 
where, d = , z = -(d-b) 



and 

f(y,/3) = ^y 2 b b + ^yb i + ±(/3-3y + l 

Q(y,/3,d) = ^y 2 z{z 4 -Wz 2 + 15) + ^y(z 2 -l) + ^(/3-3)z{z 2 -3) 
g(y,fJ,d) = l±T ir 2 + ^T 2 y + ±(fJ-3)T 3 



While Zj , T 2 and T 3 are respectively, 

7, = z 5 +bz 4 +{b 2 -10)z 3 +b[b 2 -6)z 2 +(b 4 -3b 2 +\5)z + b[b A -b 2 +3). 
T 2 =(z 2 +bz+b 2 -l),r 3 =(z 3 +bz 2 +b 2 z-3z+b 3 -b). 

Proof 

Under the risk neutral measure, the value of the European call option at time zero is 

C () =ex P (-rt)£ e [(S,-/0 + J. 

From (3.6), we have 



S, =^exp((r-<r72)f + crVK,), W, D N(0,t) 
Let Y = W t /-Jt ,then 
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S, =- 5 -exp((r-<T 2 /2)f + <T^F),FD7V(0,l) 

The value of the European call option at time zero is max (5, —K,0). Then 

C =exp(-rf)£ , {max(S, -K,0)} 

= exp(-rt)E\max\-?-exp((r-a 2 /2)t + a*JtY}-K,0 

= exp(-rf)J = °max \ -^-exp([r-a 2 /2)t + a^ftY)-K,0 \f(y)dy 
Where f(y) given by (2.2) . 



Now, max — -exp((r-a" 2 /2jr + aV?F)-,R",0 occurs when 



r< _ 

-^exp((r-cr 2 /2)t + <j'JtY}>K<^Y>z=(ln(Kf/S a )-(r-<j 2 /2)ty<j$ 

which implies that 

C =exp(-rt)[^xp[(r-a 2 /2)t+a^tY)-K ) jf(y)dy 

= ^exp((-cT 2 /2)t)j\xp(a^tY)f{y)dy-Kexp{-rt)jy{y)dy 

By recalling Remark 3.1 and Remark 3.2 and taking b = cryf we have 



C = ^-exp((-a 2 /2)t)j\xp(a^Y)f(y)dy-K e xp(-rt)^f(y)dy 

= S ®(-z + b) + ^<f>(-z + b)g-Kexp(-rt)®(z)-Kexp(-rt)<f>(z)Q 
= S <S>(-z + b)-Kexp(-rt)Q>(z) + —<l>(-z + b)g-Kexp(-rt)<t>(z)Q 



Since ®(z) = l-<I>(z) = 0(-z) , then 
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( S„ 



C = (S <3>(-z + b)-Kexp(-rt)<3>(-z)) + -^0(-z + b)g-Kexp(-rt)0(z)Q 

v / 



But, 



-z + 



ln(Kf/S )-(r-a 2 /2)t 

crjt = p — + a^t 

a«Jt 



r + °"V f = r 



cr-jt 



then, -z = d- a --ft. 



Hence, 



C = (s o <S>(d)- Kexp(-rt)<S>(d -a-Jt}) + —<j>(d) g - K exp(-rt)^{d -cr-Jt^Q 



f 



Proof completed. 

Lemma 3.2 states that the Edgeworth European call option price is the same as the 
classical option price plus a significant term that depends mainly on the standardized 
skewness and kurtosis of the underlying asset's return distribution. This term is 



(3.7) S <f>(d)--Kexp(-rt)<f>(d-(T'Ji)Q(r,P,d) 



Clearly, under the normal distribution, 



d,- = 0and<2 = 

f 



0^>C, 



Edg 



c" 



Thus (3.7) measures the departure amount from normality assumption. Also, note 
that, Lemma 3.4 provides a solution of the Black-Scholes equation under the 
Edgeworth series. Empirical study could be used to test our formula in Lemma 3.2. 
[23] tested the model suggested by [8], to price call options on S&P CNX Nifty. 
Their results strongly suggested that including the skewness and kurtosis terms to 
the Black-Scholes option pricing formula yield values much closer to market prices. 
They confirm by their study that fitting of higher order moments of the distribution 
of returns would indeed improve the approximation of the call option, especially for 
options away from the money. This suggest that using our formula would indeed be 
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more efficient than other formulas. Before, we end up this section, it would be worth 
mentioning the delta hedging which controlled the risk occurs in option pricing, 

Corollary 3.1 

Under the conditions of Lemma 3.2, the delta hedging result from applying 
Edgeworth option pricing formula is 

where, 

^ = j± ri y + Ir 2 v + -L(^-3)r; 

dS [72 ' 6 ' 24 v ! 3 

?Q = \^ r i( z >- 6z i + 3) + l - r ( z ) + l -(j3-3){z 2 -l)i 7r^--^,*=l,2,3. 
8S [72 l ; 3 V J 8 V A ') ' bS dz 

Proof 

From [16], the delta hedging is 

dS ° v ' 

Thus, 
A = O(rf) + AJs ^(rf)^-A:exp(-rt)^(d-6)e 

by using the chain rule differentiation technique and using the idenities 
where, 

sr. 

r. =-bS— L ,i = 1,2,3. 

the proof is complete. 

Now, to find the estimated values of rt%s ,g and b from the market data we 
proceed as follows; 
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Let SpK ,S n be the daily closing prices, and let R = ln(5 ;+1 )- In (5,.) for all 
i= 1,2,.. .,n , then 



&= R= n 'a ^ >$ = %/252^ where, S R = (n- 1) " 2 ja (/?,-- /?)' 



« n " gfc- #£ , 

« = a g ± anc i 

(n- l)(n- 2), = 1 § 5 S g 

»6i+l) " gfc- tfg 3(n- l) 2 

M= - — - — - — -a *- 



(n- l)(w- 2)(m- 3), = 1 | S R (n- 2)(n- 3) 
be executed directly from Excell sheet. 



All the above formulas can 



Example 3.1 

We considered the data sheet found in Hull website that represents the stock prices 
for S&P during the period from 18 th of July 2005 till 13 th of August 2010. The 
descriptive statistics for Returns are illustrated below 





Mean 


Std 


Min 


Max 


SK 


Kurt 


Returns 


0.000024 


0.015531 


-0.09 


0.1158 


0.027903 


9.250233 



Table 3.1. Descriptive statistics for S&P Returns 



Histogram for Returns 

Normal 




-0.09 -0.06 -0.03 0.00 0.03 0.06 0.09 
Returns 



Mean 


0.00002401 


StDev 


0.01553 


N 


1278 



Figure 3.1. Returns for S&P prices from 18/07/2005 till 13/08/2010 
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Probability Plot of Returns 

Normal 



am 

Tf"f""j/ 



Mean 


0.00002401 


StDev 


0.01553 


N 


1278 


AD 


39.293 


P-Value 


<0.005 



0.00 0.05 

Returns 



Figure 3.2. Testing normality of return's underlying distribution using Anderson- 
Darling Test. 

Figure 3.1 and Figure 3.2 above assert that the underlying asset return's distribution 
do not follow a normal distribution with P-value less than 0.005, using Anderson- 
Darling test. To compare our formula with the classical Black-Scholes formula 
presented in 1973 and with the modified one that was presented by [8] we consider 
the following data: 

Data: S = $1200, K = $1200, r = %5, a = 0.24655 years, t = 5.07 years 





Black-Scholes 


Gram-Charlier 


Edgeworth 


Call Option 


$390,486 


$374,255 


$350.85 


Delta 


0.769 


0.795866 


0.73791 


d 


0.7342089 


0.7342089 


0.6881122 



Table 3.2. European call option and Delta for each case 

Table 3.2 shows that our formula adjusted the European call option price rather than 
the previous ones, moreover the value of the delta hedging is less than others. These 
results indicate that using Edgeworth option pricing formula would be much better 
for detecting the high oscillation of market prices. 

In the last section, it would be suitable to find standardized Edgeworth expansions 
for some densities that are widely used in finance; these densities are lognormal, 
student's t distribution, and chi-squared distributions. 
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4. EDGEWORTH APPROXIMATION FOR STATISTICAL 
DISTRIBUTIONS 

I. Lognormal distribution 

A random variable X has a lognormal distribution with parameters fj. and a , if 
ln(X) D N(jU,<j 2 ) . The probability density function of the lognormal distribution 



f x (x; ju, <t) = \2n:x 2 a 2 J exp 



2~l a 



7(x>0), 



It is mainly used to describe the distribution of asset prices over a specific period of 
time. The skewness and kurtosis of X are respectively 



7- 



Y + i)i 



e a -land /? = e 4CT +2e icT ~ + 3e 2CT -3. 



From (2.2j the Edgeworth asymptotic expansion for the standardized lognormal 
distribution is 



f(x) = j(x) 



1 + 6 1 (t7 + 2)^77 2 -1x(x 2 -3) + 24 _1 (77" +2t/ 3 + 3^ 2 -6)(x 4 -6x 2 +3) 
+72" 1 (77 + 2) 2 (77 - 1) (x 6 - 15x 4 + 45 x 2 - 15) 



K 



0(1) 



77 = e 



Clearly, the Edgeworth series depends mainly on the variance of the underlying 
distribution. 

Table 4. 1 below shows that both the values of skewness and kurtosis increases as the 
value of standard deviation increases. 
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Standard deviation 


Skewness 


kurtosis 


0.01 


0.0300017501 


3.00160023 


0.05 


0.1502190453 


3.04014412 


0.10 


0.3017590981 


3.16232386 


0.50 


1.7501896560 


8.89844568 


1.00 


6.1848771360 


113.9363922 



Table 4.1. Skewness and kurtosis values associated for every a under the 
lognormal distribution 

Figure 4. 1 below exhibits the behavior of the standardized Edgeworth density as the 
values of standard deviation increases. 



/ 




r 






X 

Figure (a), <T=0.01 




X 

Figure (b), <T=0.1 
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Figure 4.1: The standardized Edgeworth approximation under the lognormal at 

different values of a 



II. Student's t distribution 

The probability density function of the T □ t (v) is 



fr(t)- 



K„ 



1 + — , te% 



v 



v 



Where, k v = <JvB(l/2,v/2) . Note B(a,b)= K ' K ' . 

From (2.2) , the Edgeworth asymptotic expansion for the standardized t (v ) is 



f (x) = (^(x)jl+ -4 (x 4 -6x 2 +3)1 + 0(1) , for all v > 4. 



The standardized density for T is 
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•(*): 



V + l 



H v ~ 2 ) 



..2 Y 



1 + - 



v-2 



XG% v>2. 



The Edgeworth expansion for t ( v j depends on v and clearly, 
]imf(x) = &(x) + 0(l). 

To illustrate further the role of v , consider v =5, 10 and 20. 

Figure 4.2 below shows the poor performance of the Edgeworth series for the 
standardized density of the t distribution with v —5. Here the kurtosis lies outside 
the validity region. 



0.7 
0.6- 




Variable 
-T(5) 
Exac5 



Figure 4.2. The standardized? (5j density and its Edgeworth approximation 

Figure 4.3 below shows the good performance of the Edgeworth approximation for 
the standardized density at v =10. Here the skewness and the kurtosis lie inside the 
validity region. 
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Variable 
- T(10) 
ExaclO 



-3 -2 -1 



Figure 4.3. The standardized t (10) density and its Edgeworth approximation 

Similarly, Figure 4.4 below shows the good performance of the Edgeworth 
approximation to the standardized density at v = 20. Both the skewness and the 
kurtosis lie inside the validity region. 




Variable 
- T(20) 
Exac20 



-3 -2 -1 



Figure 4.4. The standardized t (20 J density and its Edgeworth approximation 

III. Chi-Squared distribution 

The Edgeworth asymptotic expansion for the standardized chi-square distribution 
with r degrees of freedom is 

'w-*» rfVi* * — r 0(l) - 
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The Edgeworth expansion of the chi-square distribution depends on the degrees of 
freedom. Note also that lim/(i) = <f>(x) + 0(\) . 

To illustrate the effect of increasing the degrees of freedom r on the accuracy of the 
Edgeworth approximation, we take r = 2, 5, 10 and 50. 

Clearly, at r = 2 yields the case of an exponential distribution with mean two, and 
the Edgeworth expansion for this case is 



f( x ) = ( f,( x )\± x o -l x * ^x 3 +x 2 -x + — 1 + 0(1). 
J {18 12 3 12 J W 



Figure 4.5 below shows poor performance of the standardized Edgeworth density 
under the exponential distribution due to the sharp values of both the skewness and 
kurtosis, where both the skewness and kurtosis, y = 2, /3 = 9, are outside the validity 
region. 




Figure (a), r = 2 




Figure (b), r = 5 



Figure 4.5. The standardized Edgeworth approximation under the chi-squared with 

r =2 and 5. 
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At r =5, this case leads to a better performance than r = 2 . Note that y = 1.265, 
/3 — 5.A both of which are outside the validity region. However, the value of the 
kurtosis is close to the upper bound specified in the validity region. 

Figure 4.6 (a) shows the performance at r = 10, where y= 0.8943 and /?=4.2. 

This is a nice case where the skewness is outside the validity region but the kurtosis 
is inside the validity region. Nevertheless the skewness is not large. Figure 4.6 (b) 
shows excellent performance, since both the skewness and kurtosis are inside the 
validity region. 



A 










X 

Figure (a), r = 10 




Figure (b), r =50 



Figure 4.6. The standardized Edgeworth approximation with 10 and 50 degrees of 

freedom 

As a conclusion, the Edgeworth approximations improve as r increases. 

CONCLUSIONS 

From the results of previous sections, we deduce that Edgeworth 
distribution is easily handelled and yields good approximation under its validity 
region. We find the Edgeworth option pricing formula which indeed enhances the 
theory of black-Scholes model. For further studies, one could test our formula using 
empirical data. 
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Abstract 

Here we obtain a wide range of vectorial integral operator general 
incqualitites on time scales using convexity. Our treatment is combined 
by using the diamond-alpha integral. When that fails in the fractional 
setting we employ the delta and nabla integrals. We give many interesting 
applications. 

2010 Mathematics Subject Classification: 26B25, 26D15, 26A33, 39A12, 
93C70. 

Key words and phrases: time scales, diamond alpha integral, time scales 
fractional Riemann-Liouville integral, time scales fractional derivative, vectorial 
time scales integral operator. 



1 Background 

We start with the definition of the Riemann-Liouville fractional integrals, see 
[22]. Let [a, b], (— oo < a < b < oo) be a finite interval on the real axis K. The 
Riemann-Liouville fractional integrals I" + f and I"_f of order a > are defined 

by 

(^ + f)(x) = f ^jy(t)(x-t) a - 1 dt, (x>a), (1) 

(I^f){x)^ f ^ ) jj{t){t-x) a - l dt, (x<b), (2) 

respectively. Here L (a) is the Gamma function. These integrals are called the 
left-sided and the right-sided fractional integrals. We mention a basic property 
of the operators /"+/ and I"_f of order a > 0, see also [26]. The result says 
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that the fractional integral operators I" + f and I"_f are bounded in L p (a,b), 
1 < p < co, that is 



where 



||4V|| P <*II/II P , \\i?-f\\ p <K\\f\\ p (3) 

K= {b -" r . (4) 

ar(a) V ' 

Inequality (3), that is the result involving the left-sided fractional integral, was 
proved by H. G. Hardy in one of his first papers, see [18]. He did not write down 
the constant, but the calculation of the constant was hidden inside his proof. 

So we are motivated by (3), and also [5], [7], [8], [21], [2], and we will prove 
analogous properties on Time Scales. But first we need some background on 
Time Scales, see also [12]. 

A time scale T is an arbitrary nonempty closed subset of the real numbers. 
The time scales calculus was initiated by S. Hilger in his PhD thesis in order 
to unify discrete and continuous analysis [19, 20]. Let T be a time scale with 
the topology that it inherits from the real numbers. For ( £ T, we define the 
forward jump operator a : T — > T by 

a{t) =inf{seT: s>t}, (5) 

and the backward jump operator p : T — > T by 

p(t) =sup{seT : s < t}. (6) 

If a(t) > t we say that t is right-scattered, while if p(i) <t we say that t is left- 
scattered. Points that are simultaneously right-scattered and left-scattered are 
said to be isolated. If a (t) — t, then t is called right-dense; if p (t) — t, then t is 
called left-dense. The mappings p, v : T — ► [0, +oo) defined by p (t) := a (t) — t 
and v (i) := t— p (t) are called, respectively, the forward and backward graininess 
function. 

Given a time scale T, we introduce the sets T k , Tj,, and T| as follows. If 
T has a left-scattered maximum t\, then T k = T — {ii}, otherwise T fe = T. If 
T has a right-scattered minimum £2, then T^ = T — {£2}, otherwise T^ = T. 
Finally, T| =T fe nT fc . 

Let / : T — > R be a real valued function on a time scale T. Then, for t € T fe , 
we define / A (£) to be the number, if one exists, such that for all e > 0, there is 
a neighborhood U of t such that for all s <G U, 

\f (a(t)) f (s) f A (t)(a(t)- s)\ <e\a(t) s\ . (7) 

We say that / is delta differentiable on T fc provided / A (t) exists for all t G T fe . 
Similarly, for t G T^ we define / v (t) to be the number, if one exists, such that 
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for all e > 0, there is a neighborhood V of t such that for all s € V 

\f(p(t))-f( S )-r(t)(p(t)-s)\<e\p(t)-s\. (8) 

We say that / is nabla differcntiable on ¥&, provided that / v (£) exists for all 
ieT fe . 

For / : T ->■ R we define the function /"• : T -*■ R by /^ (t) = / (a (*)) for 
all i e T, that is / CT = / o er. Similarly, we define the function / p : T — > R by 
fp (t) = f(p (*)) for all t € T, that is, /' = / o p. 

A function / : T — > R is called rd-continuous, provided it is continuous at all 
right-dense points in T and its left-sided limits finite at all left-dense points in 
T. A function / : T — > R is called Id-continuous, provided it is continuous at all 
left-dense points in T and its right-sided limits finite at all right-dense points in 
T. 

A function F : T — > R is called a delta antiderivative of / : T — > R provided 
that F A (t) = / (t) holds for all t e T k . Then the delta integral of / is defined 

by 

" f(t)At = F(b)-F(a). (9) 



A function G : T — > R is called a nabla antiderivative of g : T — > R, provided 
G v (t) = 5 (t) holds for all t e Tj.. Then the nabla integral of g is defined by 
J g(i)Wt = G (b) — G (a). For more details on time scales one can see [1, 12, 13]. 

Now we describe the diamond-a derivative and integral, referring the reader 
to [23, 25, 27, 28, 29, 30] for more on this calculus. 

Let T be a time scale and / differentiable on T in the A and V senses. For 
t € T* we define the diamond-a dynamic derivative f^ (t) by 

fO a (t) = af A (t) + (1 - a) f (t) , 0<a<l. (10) 

Thus, / is diamond-a differentiable if and only if / is A and V differentiable. 
The diamond-a derivative reduces to the standard A derivative for a = 1, or 
the standard V derivative for a = 0. Also, it gives a "weighted derivative" for 
a € (0, 1). Diamond-a derivatives have shown in computational experiments to 
provided efficient and balanced approximation formulae, leading to the design 
of more reliable numerical methods [27, 28] . 

Let /, g : T — > R be diamond-a differcntiable at t <G T|. Then, 
(i) / ± g : T — > R is diamond-a differentiable at t £ T^ with 

(f±gf a (t) = (f) <>a (t)±(gf a (t). (11) 

(ii) For any constant c, cf : T — > R is diamond-a differcntiable at t € T^; 
with 

(cff a (t) = c(ff(t). (12) 
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(iii) fg : T — > M. is diamond-a diffcrcntiable at t <G T^ with 
{fgf (t) = iff (t) g (t) + ar (t) g A (t) + (l-a) F (t) g v (t) . (13) 

Let a, t G T, and h : T — > R. Then, the diamond-a integral from a to t of h 
is defined by 

t pt pt 

h(r)<} a T = a h (r) At + (1 - a) / /i(t)Vt, < a < 1. (14) 

*/ a J a 

We may notice the absence of an anti-derivative for the <C> Q combined derivative. 
For t 6 Ifc, in general 



/l(r)O a r ^ /»(*). 



(15) 



Although the fundamental theorem of calculus does not hold for the ^-integral, 
other properties hold true. Let a,i),teT,ceR. Then, 



(i) 



{/(r)± ff (r)}O a r= / f{r)<) a T± f <? (r) Q r; 



(16) 



(") 



t r-t 

cf{r)O a T = C /(t)Oar; 



(iii) / /(r)Oar= / /(r)^r + / /(r)OaT; 



(iv) If / (*) > for all £, then / / (£) Q i > 0; 

J a 

(v) If f(t)<g (t) for all t, then / / (t) a t < j g (t) a t; 



(vi) If / (t) > for all t, then / (i) = if and only if / / (£) <} a t = 0; 

rb 



(vii) 



c<C>«i = c(b — a) : 



(viii) 



We mention 



/(*)0«* 



< / |/(t)|0at. 



Theorem 1 (multivariate Jensen inequality, see also [15, p. 76], [24]) Let 
f be a convex function defined on an open and convex subset C C M. n , and 
let X = (Xi, ...,X n ) be a random vector such that Probability (X G C) = 1. 
Assume also E(\X\), E (\f (X)\) < oo, E stands for the expectation. Then 
EX e C, and 

f(EX)<Ef(X). (17) 
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We would use Jense's diamond-a inequality 

Theorem 2 ([14], Jensen's inequality with multiple variables) . Tet T a time 
scale, S C R n open and convex, a, b G T and /:S-tlfl continuous convex 
function. Let h, gi, ..., g n G C (T, R) such that J \h (t)\ (} a t > and g\ ([a, b]) x 
... x g n ([a, 6]) C S. Then 

f ( jl\h(t)\gi(t)Q a t QhiMgnmat ] < />(f)l/(gi(*),---,gn(f))<>at 



/>(*)! 0a« 



/>(*)! 0«t 



/>(*)! 0a* 



(18) 



Remark 3 ij 5j/ [9] and [17] we conclude that the multivariate Jensen's in- 
equality is valid for the delta-A and nabla-V Lebesgue integrable functions and 
measures, respectively. 

ii) Let $ : R™ — ► R, we call it increasing per coordinate, iff whenever Xi < yi, 
i = l,...,n; x i ,y i eR + , then $ (x 1; ..., x n ) < $ (y lt ..., y n ) . 

In [5], we proved that if $ : R™ — > R is convex and increasing per coordinate, 
then $ is continuous. 

We can extend $ to $ : R™ — ► R and stiW 6e convex, by defining 



$(xi,...,Xj,...,x n ) ■■= $(xi,...,-Xj,...,X„), 



(19) 



for any Xj < 0, j e {1, ...,n}, see Lemma 10. 

Hence we can apply Jensen's inequality using $ on R n . /£ is we/Z known, 
that a convex function on an open and convex subset o/R™ is continuous. 

We further need 

Theorem 4 (Holder's Inequality, see [16]) For continuous functions /, g : [a, 6] T 
R, we have: 



\f(t)g(t)\O a t< I \f(t)\ p O a t I \g(t)\ q O a t 

where p > 1, and q = -^y- 

We obtain 

Theorem 5 (Generalization of Holder's inequality) Let fi € C {[a 
l,...,n, and pi > 1 such thatY^i=i — 1- ^Tien 

i n n / „(, N 

ni/*(*)i0at<n / i/*(*)roat 



, l/j T , J 



»;=i 



(20) 



(21) 
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Proof. Using (20) and induction hypothesis, exactly as in [31]. ■ 

Comment 6 By Tietze 's extension theorem of General Topology we easily de- 
rive that a continuous function f o/TIi=i (i a ii h] H Tj) (where Tj, i = 1, ..., n e 
N are time scales) is bounded, since its continuous extension F on Yii=i [ a ii bi] Q 
K n is bounded, n G N. 

Comment 7 It is regarding the univariate functions. Based on [17] we see that 
the Cauchy Time scales delta A and nabla V integrals are equal to definite Rie- 
mann Time Scales A, V integrals, respectively. Thus, the diamond-a-Cauchy 
integral (14) is a diamond a-Riemann integral over continuous functions. Of 
course the last integral exists, since continuous functions are Riemann A and 
V -integrable, and it is equal to the corresponding a-Lebesgue integral, by [17]. 

In particular the dominated and bounded convergence theorems hold true with 
respect to the Lebesgue-A, V measures. 

Comment 8 Let Ti, T2 be time scales and f : [a, 6] T x [c, d] T — > R be con- 
tinuous. By [9] and [10] we get that f is Riemann A and V ' -integrable over 
[a,b)f 1 x [c,d)f 2 and (a,b]j 1 x (c,d]j 2 , respectively. Hence by [9], [10], f is 
Lebesgue A and V -integrable there. 
Thus by Fubini's theorem we get 

J I J f(x,y)Ay\ *x = j if f {x,y) Ax\ Ay, (22) 



and 



f if f{x,y)Vy\Vx = f I J f(x,y)Vx\v y . (23) 



We define (a € [0, 1]) 




f(x,y)<} a y <? a x 



<.j b (jy(x,y)Ay\A 




f(x,y)Wy)Wx. (24) 



One can generalize (24) for multiple integrals. 

So for f continuous we get the {} a -Fubini 's theorem main property: 

f(-r-!l)0n!l}0,,r= j | / f {:>: y) 0„.r } <> n! j. (25) 

We make 
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Remark 9 Let Ti, T2 be time scales and / : [a, b] T x [c, d] T — > R &e continu- 
ous. Consider 

g(x)= / f(x,y)<>ay 

pd pd 

= a f(x,y)Ay+(l-a) f (x,y)Vy, (26) 

^ c ^ c 

a € [0,1], V x G [a,6] Ti . 

We prove that g is continuous on [a, b] T . Let x n — > x, where {x n } ne jq, 
x G [a, o] T f/ien / {x n ,y) — > /(x,y), as n — > oo, V y G [c, d] T . Furthermore 
there exists M > swc/i £/ia£ |/ (x„, y)|, |/ (x, y)| < M, V y G [c, <i] T . Hence by 
Lebesgue's bounded convergence theorem ([9]) we get that 

lim f f(x n ,y)Ay = f f(x,y)Ay, (27) 

and 

lim J /(x„,y)Vy = f / (x, y) Vy. (28) 

n ^°°Jc Jc 

Combining (21) and (28) we obtain g (x n ) — ► g(x), as n — > 00, proving the 
continuity of g. 

For completeness we give 

Lemma 10 Lei $ : R™ — ► R increasing per coordinate and convex. We extend 
$ on R™ by defining $ (xi, ..., Xj, ..., x n ) := $ (a?i, ..., —Xj, ..., x„), /or any Xj < 
0, j G {l,...,n}. T/ien $ zs convex onMJ 1 . 

Proof. Let < A < 1, (x\, ...,x n ) , (y 1; ...,y n ) G R™. Then we observe that 

$ (A (xi, ..., x n ) + (1 - A) (|/!, ..., y„)) = 

$ (Axi + (1 - A) yi, ..., Ax„ + (1 - A) y„) = 

$ (|Axi + (1 - A) yi| , ..., |Ax„ + (1 - A) y n \) < 

$ (A |xi| + (1 - A) \ yi \, ..., X\x n \ + (1 - A) |y„|) = 

$(A(|x 1 |,...,|x Il |) + (l-A)(| 2/1 |,...,|y n |))< 

A$(|xi|,...,|x„|) + (l-A)$(|j/i|,...,|y„|) = 

A$ (x 1: ..., x n ) + (1 - A) $ (y 1; ..., y n ) , 

proving the claim. ■ 
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General Notation 11 Let (fii, Si, /i 2 ) and (Cl 2 , S2, /i 2 ) be measure spaces with 
positive a-finite measures, and let k : Qi x Q 2 ~^ R be a nonnegative measurable 
function, k(x,-) measurable on fl 2 and 

K{x) = f k(x,y)dn 2 (y), x e fii. (29) 

•M2 2 

We suppose that K {x) > a.e. on fii, and oy a weight function u (shortly: a 
weight), we mean a nonnegative measurable function on the actual set. Let the 
measurable functions gi : f2i — ► R, i = 1, ...,n, with the representation 

g t (x)= k (x, y) /; (y) d/u, 2 (y) , (30) 

Jn-2 

where fi : Q, 2 — ► R are measurable functions, i = 1, ...,n. 

Denote by~x = x := (xi,...,x„) e R™, ~g := {gi,...,g n ) and f := (/1, ...,/„). 

We consider here <I> : R™ — > R a convex function, which is increasing per 
coordinate. 

Example 12 (7or <!>,). 

-Zj Given gi is convex and increasing on R +; iften <f> (xi, ..., x n ) := £ i=1 $« (^j) 
is convex on R™ , and increasing per coordinate; the same properties hold for: 

•V IMIoc = ™ aX *» 
»e{l,...,nj 

5; Er=i (<■»?). 

^eiue;=i4 

<9j Zei <7j are convex and increasing per coordinate on R™ , t/ien so is E^=i e. 9i ^ x > , 
and so is In fejli e* (a: )) , x e R™ . 

General Notation 13 From now on we may write 

~g(x)= k(x,y) f (y)dfi 2 (y), (31) 

Jn-2 

which means 

(g 1 (x),...,g n (x))= / k(x,y)f 1 (y)dfj, 2 (y),-, k(x,y) f n (y)dfi 2 (y) 
\Ju 2 Jsi-2 

(32) 

Similarly, we may write 



7(*)\ 



k (x, y) f {y) d\i 2 (y) 
n 2 



(33) 
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and we would mean 



i\9l(x)\,-,\9n(x)\) 



k {x, y) /i (y) d\i 2 (y) 



k (x, y) f n (y) d\i 2 (y) 



We also can write that 



\g (x)\ < k(x,y) f {y) d/j, 2 (y), 

and we mean the fact that 

\9i(x)\< k{x,y)\fi{y)\dn 2 {y)i 
Jn 2 

for all i = 1, ..., n. Similarly for other properties. 



(34) 



(35) 



(36) 



General Notation 14 Next let (Cli, Ei,/^) and (Vl 2 ,T, 2 , /j, 2 ) be measure spaces 
with positive a-finite measures, and let kj : f^i x fl 2 — > R be a nonnegative 
measurable function, kj {x, •) measurable on Q 2 and 



Kj (x) 



kj(x,y)dn 2 (y) , x € Ct lt j = 1, ...,i 



(37) 



We suppose that Kj (x) > a.e. on Q\. 
Let the measurable functions gji : fl\ 



9ji (x) 



R with the representation 
kj {x, y) fji (y) d[i 2 (y) , (38) 



where fji : Q 2 — > K are measurable functions, i — 1, ..., n and j — 1, ..., m. 

Denote the function vectors gj := (gji,gj2, ■■■, 9jn) & n d fj '■— ifji,---,fjn), 
j = l,...,ra._^ 

We say fj is integrable with respect to measure \x, iff all fji are integrable 
with respect to \x. 



We consider here <&, 



l+, j — l,...,m, convex functions that are 



increasing per coordinate. 

Again u is a weight function on Q,\. 



2 Main Results 

We present vectorial inequalities on <^> Q - integral operators. 

Theorem 15 Let T 1; T2 be time scales, a, b G T 1; ' c,d € T 2 ; k{x,y) is a 
kernel function with x € [a, b] T , y € [c, d] T ; fc is continuous function from 
[a,b]j x [c, d] T into R.+ . Consider 

K(x):= / k(x,y)<) a y, V a; e [a, 6] Tl . (39) 
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We assume that K (x) > 0, V x £ [a, 6] T . Consider fi : [c, d] T — ► 
and the {} a -integral operator function 

9i 0) ■■= k (x, y) h (y) <> a y, 

Vie [M] Tl > i = l,--,"- Let~g := (#1, ...,#„) , / := (/i, ••■,/„)■ 
Consider also the weight function 



continuous, 



(40) 



u : [a,b] Ti ->M.+ , 



(41) 



which is continuous. 

Define further the function 



f b u(x)k{x,y) 
v(y):= i R{x) a x, 



(42) 



V y e [c,d] T2 . 

Let I denote any of (0, oo) or [0, oo) n , and $ : / — ► R &e a convex and 
increasing per coordinate function. In particular we assume that 



The 



u(x) $ 



l/il ([ c ^]t 2 ) ^ 7 > * = l,-,n. 

♦ a*< / «(y)*(|/(l3|)0al/- 



9{x) 



K(x) 



(43) 



(44) 



Proof. We see that 

b 



ii(x)$ 



g( x ) 



K{x) 



<} a x = / u (x) $ 



if (a;) 



' / u{x) ®\kJx) I k{x,y) fiy) 



k(x,y)f(y)O a y 



<> a y a x 



<> a X 

(45) 



(by generalized Jensen's inequality, see Theorem 2 and Comment 7) 
rb u(x) / '- d 



< 



K y , y i k (x, y) $ ( / (y) ) ^ Q y J ^ Q x 

'' / '■ d u{x)k{x 1 y) i 



(by (25)) 



a \ J c 



d ,-b 



K(x) 



-$ 



M( x)fc(x,y) 
^(x) 



$ 



/(2/)|)Oa2/jOaZ 

F(yj|)^aa;jO Q y 



(46) 
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*(|7(y)|) 



u(x)k(x, y) 
K(x) 



0*x)o a y= I w(y)*(|/(i/)|)Oai/ 



(47) 



proving the claim. ■ 
We continue with 



Theorem 16 All as in Theorem 15, however now $ is not necessarily increas- 
ing per coordinate and only from (0, oo) into R. Additionaly we assume that 
each fi is of fixed strict sign, i — 1, ...,n. Then 



" W MWTr a:C - w(y)*(/(y)JOaI/- (48) 



Proof. We notice that 

rd 



9{x) = 
Therefore we have 



k(x,y) f(y)<> a y 



k(x,y) f(y) a y. (49) 



u(x)<& 



g(x) 

K(x) 



<} a x = / u (x) $ 



if (a;) 



k(x,y) f (y)<> a y 



u (x) $ 



1 



K{x) 



k(x,y) f{y) <> a y <> a x 



<> a x 



(50) 



The rest follows as in the proof of Theorem 15. 
Corollary 17 (to Theorem 16) It holds 



J h u (x) J2 In (jfgf) OaX > jT v (y) ]T In (\f t (y)\) <> a y. (51) 

Proof. Apply (48) for <E> (x) — — Y^,7=i mx ii which a convex function with 
domain (0, oo) . ■ 



Corollary 18 (to Theorem 15) It holds 



u(x)J2e^^0 a x< v(y)J2^ Mv)l a y- (52) 

Proof. Apply (44) for $ (x) = YTi=\ eXi > Xi > 0. * 
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Notation 19 Let T±, T2 be time scales, a, b G Ti; c, d G T2; kj(x,y) is a 
kernel function with x G [a, 6] T , y € [c, <i] T ; &j is continuous function from 
[a, o] T x [c, d] T into R+ /or j = 1, ..., m € N. Consider 



(53) 



AT,- (a;) := / kj (x, y) ^ Q y ; V x e [a, b] Ti , 



j = l,...,m. 

We assume that Kj (x) > 0, V 1 £ [ a 7^T ' 3 ~ l,---, m - Consider fji : 
[c,ii] T — > M. continuous, i — l,...,n, and j — l,...,m, and i/ie § a -integral 
operator function 



9ji (%) : = / % ( x , y) /ji (y) 0«y, 



(54) 



V x e [a,6] Ti , i = l,...,n, j = l,...,m. 

Denote the function vectors gj := (gji, ...,gj n ) and fj :— (fji,---,fj n )> j 



1,...,TO. 



Consider also the weight function 

u : [a,b] Ti -*•»+, 



(55) 



which is continuous. 

Define further the function 

A m (y) := 



■ 0) IIJLi k J if, y) 

Il7=i K *W 



va^i 



(56) 



Here $., : R" — > K +; j = 1, ...,m, are convex and increasing per coordinate 



V y e [c,d] T . 

/wnctaons. 

We give 
Theorem 20 ^4/Z as m Notation 19. Let p € {1, ...,ra} fee /w:ed. TTien 



-wn $ i 



j=i 



Kj (x) 



<> a x < 



(57) 



in ,.d 



„_1 J C 



I 3=1 



a y 



*, 



/ P (y) A m (y) a y 
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Proof. We demonstrate the proof for m — 3. For general m it follows 
the same way. Here we use the multivariate Jensen's inequality, see Theorem 2 
and Comment 7, ^a-Fubini's theorem, see (25), and that $j are increasing per 
coordinate. 

We have 

3 



-wn $ i 



i=i 



9j (x) 
Kj (x) 



<> a X = 



■(x)f[*i 



j=i 



Kj (x) 



kj (x, y) fj (y)<> a y 



<> a x < (58) 



y) fj (y) 



<> a y <> a x < 



J u (x) II ( jfjx) J k i( x >y)®j (\fjty)\) <>*y) <> a x 



u (x) 



(calling 6 {x) 



n K i ( x 

u(x) 



3 ,.d \ 

II / k i ( x > y) ®j ( fi (v) ) <>°v <>°> x = 



J[k 3 (x) 
j = i 

J a Hx){f[J c k j (x,y)^ j (\f~ff)\)<> a y\<> a x 
f a 9 ^ I d (n^ *i(*.»)*i(|^I»)|)OaI/J 



*s(a:,y)*3 /s(») Oal/ 



♦<*£ 



f*[fy( x Ailf* k i( x >v)*i(ifiW 



<> a y 



d I r b I 2 d 



OaJ/ Oa^ 



^ ( x ) II/ M^)^(|/70/)|)^ 



(59) 
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h (x,y) $3 (|/a (y)\) Oax) <? a y = 



(60) 



$ a 



h{y) 



0(x)k 3 (x,y)( f { f fci(»,l/)*i( fi(y) 



c I J c 



OaV , 



(a;, j/) $ 2 ( / 2 (y) ) ♦ay) <> a x <? a y 



$3 



/ 3 (y) 



b / r d 



a \ J c 



<(x)k 2 (x,y)k 3 (x,y)$ 2 (\f2(y)\)- (61) 



fci (&, y) $i ( h (y) ) Q y I Oal/ <> Q a; 



♦ay : 



$3 



/ 3 (y) ♦ay 



b / r d 



a \ J c 



1 (a;) k 2 (x, y) k 3 (x, y) $2 h (v) 



fci (x, y) $1 ( A (yj ) ♦ay I ♦ay j <>«» 



$., 



/ 3 (y) 



♦ Q y 



d I rb 



c \ J a 



'(x)k 2 {x,y)k 3 (x,y)® 2 ( / 2 (y) 



fci (x, y) $1 ( fi(y) ) ♦ay > ♦aX j ♦ay 



(62) 



$3 



/ 3 (y) ♦ay 



(Jx ; 



My) 



$0 



h (y) ♦ay 



*i 



6{x)k 2 {x,y)k 3 {x,y)- 

fci (x, y) $1 ( My) ) ♦ay j ♦aX j ♦ay = 

3 

(*) II **(*>&)• 



$9 



/ 2 (y) 



b rd 



a \ J c 



3=1 



(\?nv)\) a y) Oax} a y 



$3 



/ 3 (y) ♦ay 



$9 



/ 2 (y) ♦ay 



14 



60 



ANASTASSIOU: VECTORIAL INTEGRAL OPERATOR INEQUALITIES ON TIME SCALES 



J (l ^(»)f[^-(*.y)*i(|/i(y)|)Oay)Oas 

11/ *i(/i(w) 

j=2 Jc 

f[ J* *s (\7i(v)\) <>*v 



3 ,d 



V J=2 

proving the claim 



II / *i ( £ (w) 

■_ n Jc 



<> a y 



$1 



Corollary 21 (to Theorem 20) It holds 






n / E ei/3i(s/)| o^ 






/ 



' rd ( n \ "N 



(63) 



/i(y)|)A3(y)0ai/ , (64) 



(65) 



Proof. Apply $, (x) = J^ILi e2:i > x * — ^' ^ or au J = ■*-' ■••> m - ■ 
We continue with 

Theorem 22 ^4// as in Theorem 20, but now $j : (0, oo)" — ► R + , j = 1, ...,m, 
are convex and not necessarily increasing per coordinate. Furthermore all fji, 
i = l,...,n, j = 1, ...,m, are of fixed strict sign. Then (57) is valid. 

Proof. Similar to Theorem 16, and Theorem 20. ■ 
We give the following application: 
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Corollary 23 All as in Theorem 22, with <frj (x) — — Y^i=i ^ nx ir J — 1, ••■, w € 
It holds 



(-1)' 



/ «wn E in 



^ (*) 



a a; < 



(66) 



m r d n 



(-i) T 









Proof. By (57). ■ 
We continue with 

Theorem 24 AZZ as in Notation 19. Define 



Uj (y) : = / u (a) 



fcj (g, 2/) 



Va*) 



V j/ e [c,d] T2 , j = l,...,m e N. 

Let pj > 1 : X)£Li ^ = 1. Then 



'3 = 1 Pj 



-wn $ i 



& (a) 



i=i 



^ (a:) 



OaCC < 



(67) 



(68) 



n 



(y)*j(fj(y)) 0«y) 



Proof. Notice that $j, j — l,...,m, are continuous functions. Here we use 
the generalized Holder's inequality, see Theorem 5. We have 



3 = 1 \ "■' 



9j 0*0 



Kj (x) 



<> a x 



/J «(^*j 



ft 0*0 



Kj (x) 



a x< 



(69) 



n / ^^ 

.7 = 1 \ Ja 



ft 0*0 



Pj 



^ 0*0 



O a x\ < 



(notice here that $^ J , j — 1, ...,m, are convex, increasing per coordinate and 
continuous, non-negative functions, and by Theorem 15 we get) 

i 

fd \ Pi 



n(/ «j(v)*i(\fj(v)\ro<*v 



(70) 
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proving the claim. ■ 
We also give 

Theorem 25 All as in Theorem 24, but now $j : (0, oo) n — ► R+, j = 1, ...,m, 
are convex and not necessarily increasing per coordinate. Furthermore all fji, 
j = 1, ...,m, i = 1, ...,n, are each of fixed strict sign. Then (68) is valid. 

Proof. Similar to Theorem 24, and by using Theorem 16. ■ 

Corollary 26 (to Theorem 24) It holds 



/ «wn 

Ja 3=1 


> 

9j {x) 

Kj (x) 


V] 


m / „d 

n / «i(») 


fj (y) 


Pi 

0«y 



(71) 



Proof. Apply (68) for $j (x) — ||x|| , xi > 0, i — l,...,n, j — l,...,m. 
Corollary 27 (to Theorem 24) It holds 



u(x)l[ l^e **<"> \0 a x< 



(72) 



n 



(») E« 



/'j 



J/ji (2/) I 



<> a y 



Proof. Apply (68) for $j (x) = Y^i=i eXi > x i > 0, i — l,...,n,for all j 
., m. m 
We need 



l,...,m. 



Definition 28 (75/j Let T k a time scale. Consider the coordinate wise rd- 
continuous functions h a : T x T — > R, a > 0, smc/i £fta£ /lo (£, s) = 1, 



K+i (t, s) = / ft a (t, s) At 



V s.teT. 
WhenT 



iften cr (t) = i, we define 

h a (t,s):= ^ ' a>0. 

1 (a + 1) 



(73) 



(74) 
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When T = Z, then a (t) = t + 1, t <G Z, and 

(t-s) {k) 
MM) = J , Vfc€N =Nu{0}, (75) 

Vi,seZ, where *W = I, i (fe) = ]J k ^ (t - i) for keN. Also it holds 

,-b 6-1 

/ /(t)At = ^/(t), a<&; a,6GZ. (76) 

Ja t=a 

We need 

Definition 29 (75/,) -For a > 1 we define the time scale A-Riemann-Liouville 
type fractional integral (a, b € T) 

Kf (t) = f h a -! (t, a (r)) / (r) At, (77) 

J a 

(by [11] is an integral on [a, t) n TJ 

^°/ = /, 

where f € L\ ([a, 6) n T) (Lebesgue A-integrable functions on [a, b) flT, see /1 7/, 

/&/, /io/;, te [a,b]r\T. 

Notice K\f (t) = J / (t) At is absolutely continuous in t G [a, 6] n T, see 
/Ji/. 

Lemma 30 ([3]) Let a > 1, f € L\ ([a, b) fl T). J/ additionally h a -i {s, a (t)) 
is Lebesgue A-measurable on ([a, b) fl T) , t/ien if"/ € £i ([a, 6) n T) . 

We need 

Definition 31 (75/J Assume T time scale such that T fe = T. 

Let /i > 2 : m — 1 < /i < m £ N, i.e. m — \/i\ (ceiling of the number), 
v = m — fi (0 < v < 1). 

Here we take f G C^([a,b] DT). Clearly here ([17]) f is a Lebesgue 
A-integrable function. Assume ho (s, a (£)) is continuous on ([a, 6] D T) . 

We define the delta fractional derivative on time scale T of order fj, — 1 as 
follows: 

A^7 (t) = (K +1 f Am ) (t) = f h~ v (t, a (t)) f Am (r) At, (78) 

Vte [a,i]ni 

Notice here that Aq* / G C([a,6] flT) by a simple argument using domi- 
nated convergence theorem in Lebesgue A-sense. 
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If /j, = m, then v — and by (78) we get 

A™- 1 / (*) = Kf Am (t) = f^ 1 it) . (79) 

More generally, by [11], given that f is everywhere finite and absolutely 

continuous on [a,b] flT, then f exists A-a.e. and is Lebesgue A-integrable 
on [a, t) n T, V t G [a, b] D T and one can plug it into (78). 

We need 

Definition 32 ([4]) Consider the coordinate wise Id-continuous functions h a : 
TxT^M,a>0, such that h (t, s) = 1, 

h a +i(t,s)= ( h a (T,s)VT, (80) 



v s,teT. 

In the case of T = R; then p(t) — t, and hk (t,s) — , , , k € No, and 
define 

h a (t,s):= ^~^ a>0. (81) 

1 (a + 1) 

Let T = Z, tfien p (i) = i-1, t € Z. Define fl := 1, t* := t (t + 1) ... (t + k - 1), 

keN, and by (80) we have h k (t, s) = ^^-, s,teZ, k e No- 
Here Jl Vr = E* +i ■ 

We need 

Definition 33 ([4]) For a > 1 we define the time scale V -Riemann-Liouville 
type fractional integral (a, b G T) 

Jaf(t)= I /Ja-l(i,p(T))/(T)VT, (82) 



(7>y /J^y i/ie Zasi integral is on (a, i] O TJ 

J°/ (*) = /(*), 

where f € L\ ((a, 6] D T) (Lebesgue V ' -integrable functions on (a, 6] n T, see /9/ ; 

[io],[i7]), te[a,b] nT. 

Notice J\f (t) = J / (r) Vr zs absolutely continuous in t € [a, 6] nT, see 

Lemma 34 f/^/) Let a > 1, f & L\ ((a,b] HT). // additionally h a -i (s,p(t)) 
is Lebesgue V -measurable on ((a, 6] n T) , i/ien J"/ € Li ((a, &] D T) . 

We also need 
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Definition 35 ([4]) Assume T^ = T. 

Let /i > 2 such that ra-l</i<m6N, i.e. m — \/i\, v = in — fi 
(0<v< I). 

Let f € C ; ™ ([a,b] flT). Clearly here ([17]) / v is a Lebesgue V -integrable 
function. Assume h^{s,p{t)) is continuous on ([a,b] flT) . 

We define the nabla fractional derivative on time scale T of order /i — 1 as 
follows: 

V^VW = U +1 f Vm ) (t) = fh~At,p(T))f m (r) Vr, (83) 

Vie [a,6]nT. 

Notice here that V^~ / € C ([ a > &] H IT) oy a simple argument using domi- 
nated convergence theorem in Lebesgue V -sense. 

If ji = m, then v = and 6j/ (#3,) we get 



i-i 



vr/w^if w = / v w. (84) 

More generally, by [1 1], given that / v is everywhere finite and absolutely 
continuous on [a,b] flT, then / v exists V-a.e. and is Lebesgue V-integrable 
on (a, i] n T, V t e [a, 6] n T, and one can plug it into (83). 

We present 

Theorem 36 Let T be a time scale, and a, b G T, a < &. wii/i a (a) = a. Let 
a > 1. /i a as in (73), and K" fi as in (77), where fi € L\ ([a, b) n T), % = 1, ..., n. 
ylsswme further that /i a _i (s, ct (£)) is Lebesgue A-measurable on ([a, &) D T) . 

Let 7 := (A, ...,/„) , *T«/ := (K°f u ...,K%f n ) . 

Call 



K*(x):=l X[a , x )(y)K-i(x,a(y))\Ay (85) 

\h a -i {x,a(y))\Ay, 



Vie ([a, 6] n T), where X[ a ,x) (v) * s ^ e characteristic function on [a, #) fl T. 
Assume that K* (x) > (delta) Lebesgue measure A-a.e. in x <E ([a, b] fl T) . 
Consider also the weight function 

u:([o,6)nT)^R + , (86) 

which is (delta) Lebesgue A-measurable. Assume that the function 

X ~* K* (x) Xla ' x) ^ ' /l " _1 ^' °" ^' 
is A-integrable on ([a, 6) n T) for each fixed y € ([a, 6) f~l T) . 
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Define v* on ([a, b) n T) by 



v*{y) 



u (x) 



J [{: ^X[a,x)(y)\ h a -l(x,(T(y))\Ax<O0. 

Let $ : WL — > K be a convex and increasing per coordinate function. Then 



(87) 



w(x)$ 



Kfi?) 



K*{x) 



Ax < 



(x)$(\7(xj\)Ax, (88) 



under the further assumptions: 



(i)7,*( 


7 


xe Qa,6)nT), 


(a) «*$ f 


7) 



are both \\ a x) (v) l^a-i (^j " (j/))| Ay-integrable, A-a.e. in 



is A-Lebesgue integrable. 



Proof. By [5], $ is continuous on R + . Here we use the multivariate Jensen's 
inequality, Tonelli's theorem and Fubini's theorem, all on Time Scales setting. 
Also we use that $ is convex and increasing per coordinate. We extend $ (x) 
on MP, see Lemma 10, so we can apply multivariate Jensen's inequality. 

Next we have 



Kfi {x) = / X [a ,x) (V) ha-i (x, a (»)) fi (y) Ay, V x e [a, b] n T, 

J a 

and 

\K°fi{x)\< f X [a , x )(y)\h a -i(x,a(y))\\f i (y)\Ay, i = l,...,n. 

J a 

We see that 

u {x) $ I — r^ 1 Ar "- 



X* (x) 



1 



(by multivariate Jensen's inequality) 



Ay } Ax < 



u (x) 



K 

b / rb 



^y lj a x [o ,x) (y) l^-i (*.*(y))l * (|/(y)|) 



Ay Ax 



a W a 



u (x) 



A"* 



7^y>C[o,x)(j/)l^a-i(a;,a-(j/))|$^ f (y) 



Ay Ax 
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(by Fubini's theorem) 
1-6 / rb u(x) 



a K*{x) 



X[ a , x )(y)K-i(x,a{y))\${ f{y) 



Ax Ay = 



$ 



f(v) 



6 u(x) \ 

K * i x \ X[a,x) (v) \ h a-i (x, o- (y))\ Ax 1 Ay 



$ 



/ (y) v* (y) Ay, 



completing the proof of the theorem. ■ 
The counterpart of last result follows: 

Theorem 37 Let T be a time scale, and a, b G T — {minT}, a < b, with p (a) = 
a. Let a > 1, h a as in (80), an d J" fj as in (82), where fi € L\ ((a, b] (IT), 
i = \,...,n. Let f := (/i,..., /„) , J". / := (J" /i,..., J" /„) . Assume further that 
h a -i (s, p (t)) is Lebesgue V -measurable on ((a, b](~]T) . 
Call 



k* (x) ■- / X[ a ,x) (y) h a-i i x ,p{y)) 



Vy 



(89) 



h a -i (x,p(y)) 



Vy, 



Vie ([ a :&] HT). Assume that K* (x) > (nabla) Lebesgue measure V-a.e. in 
xe ([a,6]DT). 

Consider also the weight function 



w : ((a,i]flT) -► R+, 

which is (nabla) Lebesgue V -measurable. 
Assume that the function 

X ~* K (x) X[a ' x) ^ ^ a_1 ^' P ^^ 

is V -integrable on ((a, 6] fl T) for each fixed y £ ((a, b] n T) 
Define v* on ((a, 6] fl T) fry 



(90) 



v* (y) 



U> (x) 



X[o,x)(y) K-i{x, p{y)) 



Vx < oo. 



(91) 



Let® 



I™ — > K fee a convex and increasing per coordinate function. Then 

rb 



w(x)$ 



■/»/(*) 



if* (x) 



Vx < 



a 



«4x)$(|7(^j|)Vx, (92) 
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under the further assumptions: 



ft) /,* 



./' 



\7y -integrable, V-o.e. in 



are both X[ a , x ) iv) K-i (x,p(y)) 

xe ((a,6]nf), 

(ii) «*$ ( / ) is W -Lebesgue integrable. 

Proof. Similar to the proof of Theorem 36. ■ 
We continue with 

Theorem 38 Let T be a time scale, and a, b G T, a < b, with a (a) — a. Let a > 
1, h a as in (73), and K"fji as ^ n C?V> where fji € L\ ([a, b) D T) , j = 1, ..., m € 
N, i = l,...,n G N. Let % := (/, 1; ..., j in ) , Kf/J := (K-f ju ...,K-f jn ), 
j = l,...,m. Assume further that h a ^\ (s,o~(t)) is Lebesgue A-measurable on 
([a,o)nT) 2 . Call 

K*{x):= j X[ a , x ){y)\K-i{x,<j{y))\Ay 

J a 

Vie ([a, 6] nT). issume t/iai K* (x) > 0, A-a.e. in x G ([a, 6] n T) 
_ffere £/ie weight function 

u: ([o,6)DT) ^R+, 

is Lebesgue A-measurable. Assume that the function 

\h a -i(x,o-(y))\ 



(93) 



X ~* U (x) X[a,x) (V) f - 



AT* (x) 

is Lebesgue A-integrable on ([a, b) D T) /or each fixed ?/£ ([a, 6) fl T) 
Define v* n on ([a, b) n T) 6j/ 

/ \h a -i{x,o-{y))[ 



v * m (y) •■= u (x) X[ a ,x) (v) f - 



Ax < oo. 



(94) 



K* (x) 

Let $j : R" — ► R+, j = l,...,m, are convex and increasing per coordinate 
functions. 
Then 



'Wii $ i 



^/j (») 



j = l 



K* (x) 



Ax < 



(95) 



f[ J^j (\W)\) ^y) N fc $i(|.My)|)<(y)A y V 

under the further assumptions: 

(i) fj>$j\ fj ) are both Xia,x)(y)\ h a-i( x ,°'(y))\ Ay -integrable, A-a.e. in 
x G ([a, &) n T) , j = 1, ...,m, and 



r»*;c*i(|K|), ^{\h\)> *3 (|K|) *« 

integrable. 



In 



are all A-Lebesgue 
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Proof. As in [5], and Theorem 20 here. ■ 
The counterpart of last result follows 

Theorem 39 Let T be a time scale, and a,b G T — {minT}, a < b, with 
p (a) — a. Let a > 1, h a as in (80), and J" fji as in (82), where fji G 

Li((o,6]nT), for j = l,...,m eN,i = l,...,n G N. Let% := (/,•!,...,/,•„). 

> .-. 

Jafj : = (JafjU—,Jafjn), j = l,—,Tn. Assume further that h a -i(s,p(t)) is 

Lebesgue V -measurable on ((a, 6] flT) . CWZ 



#* (») == / X[ ,x) (y) ^a-i i x ,p{y)) 



Vy 



Vie ([a, 6] n T). issme that K* (x) > 0, V-a.e. m x e ([a, 6] n T) . 
.ffere i/ie weight function 



w : ((a,6]flT) -> R+, 
is Lebesgue V -measurable. Assume that the function 

h a -i (x,p(y)) 



x->w (x) X[ a ,x) (y) 



K*(x) 



(96) 



is Lebesgue V -integrable on ((a, b] n T) /or eacft /ixerf y € ((a, o] H T) 
Define v m * on ((a, 6] n T) by 



v m * (y) ■= / w (x) x [a>x) (y) 



K-i ( x ,p(y)) 

Kjx) 



Vx < oo. 



(97) 



Let $j : R" — ► R+, j = l,...,m, are convex and increasing per coordinate 
functions. 
Then 



w (x) JJ $, 



JSfi (*) 



K* (x) 



Vx < 



(98) 



m r b 



fl / *; ( /i (») 

vJ=2' /o 



Vy 



$1 



/i(l/)|)v m *(y)Vy), 



under the further assumptions: 

ft) Tj, ^j(Tj) are both X[a , x ) (y) K_ x (x, p (j/)) 
x G ((a, 6] flT) , j = 1, ...,m, and 



(a) « m *$i 

integrable. 



h 



$, 



./2 



$., 



/3 



i ••■) ^m 



$, 



Vy -integrable, V-a.e. in 
I /m ) ! are afZ V -Lebesgue 
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Proof. As in [5], and Theorem 20 here. ■ 
We continue with 

Theorem 40 Let T be a time scale, and a, b G T, a < b, with a (a) = a. Let a > 
1, h a as in (73), and K" fji as in (77), where fji G L\ ([a, b) n T), j = 1, ..., m € 
N, i = l,..,n G N. Let J := (fji,...,f jn ), ^ := (K-f ju ...,K-f jn ), 
j = l,...,m. Assume further that h a ^\ (s,o~(t)) is Lebesgue A-measurable on 
([a,o)nT) 2 . CWJ 



(99) 



if*(a:):= / X[a,x) (i/) l^a-i ( x ^ (v))\ A V 

J a 

Vie ([a, 6] nT). issme that K* (x) > 0, A-a.e. iKf ([a, b] n T) 
_ffere £/ze weight function 



u: Qo,6)nT) ^M+, 
is Lebesgue A-measurable. Assume that the function 

\h a -i (x,a(y))\ 



X^U (x) X[a,x) (V) f - 



K*[x) 



is Lebesgue A-integrable on ([a, b) D T) for each fixed y G ([a, 6) n T) 
Define v* on ([a, b) n T) 6j/ 



li ( X ) 

w * (y) : = / ts* 73 X[ a ,x) (y) \ha-i (x, o- {y))\ Ax < oo 



K* 



Let Pj > 1 : YJLx 



convex and increasing per coordinate 
Then 



1. Let t/ie functions $, 



'Wii $ i 



^/i (*) 



j=i 



K*(x) 



Ax < 



(100) 
+, j = l,...,m, fee 

(101) 



II / w * (w) ^ ft (y) 

.7 = 1 "^ 



Pj 



Ay 



under the further assumptions: 



r*; /^- 



are fcott X^,^) (y) l^-a-i (^: °" (y))l Ay-mie^ra^Ze, A-a.e. 



in x G ([a, 6) n T) , /or a/? j = 1, ..., m, and 

v Pj 



(*»; w*$j 



/, 



is A-Lebesgue integrable, j = 1, ...,m. 



Proof. As in [7], and Theorem 24 here. 
The counterpart of last result follows. 
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Theorem 41 Let T be a time scale, and a,b G T — {minT}, a < b, with 
p (a) = a. Let a > 1, h a as in (80), and J" fji os in (82), where fji £ 
Li( (g,6]nT), $ = l,...,n e N, j = l,...,m € N. Lei /J := {fji,...,f jn ), 
Jafj '■= {Jafjii---iJafjn), j = 1, ...,m. Assume further thai h a -i {s,p{t)) is 
Lebesgue V -measurable on ((a, 6] nT) . Ca/Z 



K*(x):= X[ a ,x)(y) h a -i(x,p(y)) 



Vy, 



Vie ([a, 6] nT). issume i/iai X* (a;) > 0, V-a.e. m a; € ([a, 6] flT) . 
_ffere i/ie weight function 



w : ((o,6]nT) ->R+, 
is Lebesgue V -measurable. Assume that the function 

h a -i(x,p{y)) 



X^W (x) X[a,x) (V) 



K*(x) 



(102) 



is Lebesgue V -integrable on ((a, 6] n T) for each fixed y € ((a, 6] n T) 
Define v* on ((a, 6] fl T) by 



v* (y) 



W) (x) 



a K* (x) ' 



X[a,x)(y) K-i{x, p(y)) 



Vi < oo. 



(103) 



Let pj > 1 : X^^Li = 1- ^ e ^ ^ e functions <&j 
convex and increasing per coordinate. 
Then 



j=i 



Jif* (as) 



Va;< 



-, J = 1,-..,to, &e 



(104) 



m „6 

n / w * (w) $ j 

.7 = 1 ,/a 



/i (y) 



Vy 



under the further assumptions: 



ft) fi^j 



fi 



are both X[ a , x ) (v) h <*-i(x,p(y)) 



Vy -integrable, V-o.e. 



m x G ((a, 6] n T) , for all j — 1, ..., m, and 

(U,) V*$j ( /j ) is V -Lebesgue integrable, j — l,...,m. 



Proof. As in [7], and Theorem 24 here. 
We give 
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(105) 



Corollary 42 (to Theorem 38) It holds 

Ja j = l \i=\ I 

m r b n \ / „6 / n \ \ 

TJ^ £ e l/^)l A J U r£e^yAv* m (y)A y y 

under the assumptions: 

ft) /j;E"=i el/j * to)l are X[ a ,x) (v) \ha-i {x,a(y))\ Ay-integrable, A-a.e. in 
x G ([a, b) flT) , j = 1, ...,m, and 

(^ < (E"=i e l/li(!/)l ), £" =1 d^^l, j = 2, ...,m, are a/Z A-integrable. 



Corollary 43 (to Theorem 41) It holds 

t 
3=1 \i=l 



rb m / n |.j<*/.. (x )| \ 

/ w (x) Y[ [J2 e «&i Vx < (106) 






Vy-integrable, V- 



under the assumptions: 

(i)T3,(Y:ti^ fMv)l ) Pi are both X[a , x) (y) h a -i(x,p{y)) 

a.e. in x G ((a, b] D T) , /or a/? j = 1, ..., m, and 

(ii) v* (y) (E"=i e'f ji ( v >') 3 is V -Lebesgue integrable, j — 1, ...,m. 

We continue with 

Theorem 44 Lei T 6e a time scale, and a, 6 G T, a < b, with a (a) = a. 
Let all as in Definition 31 and for fji G C^([o,6]nT), j = l,...,m* G N, 

i = l,...,n G N; fo as *n (7*;. Let J*™ := (/jf\ ..., /£") , A^ 1 /, := 

[A^Z 1 fji, ■■■, A^Z 1 fjn) , j = l,...,m. CaH 

#i (s) == / X[a,x) (y) Ife (*. * (y))l A ?/ (107) 

^ a 

V .x G ([a, 6] Hi), issme i/iai i^i (a;) > 0, A-a.e. in x G ([a, 6] flT) . 
ffere the weight function 

u: ([a,b)r\T) -^R+, 

is Lebesgue A-measurable. Assume that the function 

\hu{x,o-(y))\^ r " 



X^U (x) X[a,x) (v) f 



K x {x) 

27 



73 



ANASTASSIOU: VECTORIAL INTEGRAL OPERATOR INEQUALITIES ON TIME SCALES 



is Lebesgue A-integrable on ([a, b) fl T) for each fixed y € ([a, b) fl T) 
Define tp on ([a, 6) fl T) 0?/ 

^ (y) - ["u{x)x M (y) ( ^t>° m T Ax < oo. 



(108) 



Here $, : 
functions. 
Then 



KUx) 
+ , j — 1,...,™*, are convex and increasing per coordinate 



b m, 

u (x) JJ $j 



^a* Jj V^V 



m, „t 



n / *i 



ft (y) 



Ay 



K!(x) 



$j 



Ax < 



(109) 



/i (y) 



f m , (y) A y , 



under the assumption that p m is A-Lebesgue integrable on ([a, b) n T) . 

Proof. By Theorem 38. ■ 
We also derive 

Theorem 45 Let T be a time scale, and a, b G T — {minT}, a < b, with p (a) = 
a. Let a// as in Definition 35 and for fji e C ; ™ ([a, b] fl T), j = 1, ..., to* e N, 

i = l,..,n e N; ftp as zn A50J. Let ^ v ™ := (/J\ -, /jT) , V^ 1 /, := 
(V^Vii.-.V^Vjn), J - l,...,m. tfa« 



K 2 (x):= X[ a ,x)(y) hv(x,p(y)) 



Vy, 



(110) 



Vie ([a, 6] nT). issume that K 2 (x) > 0, V-a.e. mx e ([a, 6] n T) . 
.ffere t/ie weight function 



w : ((a,i]flT) -> R+, 
is Lebesgue V -measurable. Assume that the function 

%{x,p{y)) 



x^w(x) x [a , x ) (y) 



K 2 {x) 



is Lebesgue V -integrable on ((a, 6] n T) for each fixed y <E ((a, 6] n T) 
Define ?p on ((a, 6] n T) by 



ijj (y) :-- 



w (x) 



a K 2 (x)' 



X[ a ,x)(y) hv(x,p(y)) 



Vx < oo. 



(Ill) 
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Let pj > 1 : Y^/T=i ~ — 1- ^ e ^ ^ e functions $j : R™ 
convex and increasing per coordinate. 
Then 



( 






v^ViW 



\ 



^2(») 



Vx < 



., j = 1, ...,m*, &e 



(112) 



n / ^(»)*j 

vj=l' /a 



// (») 



/'j 



Vy 



under the assumption that ip is V -Lebesgue integrable on ((a, 6] D T) . 

Proof. By Theorem 41. ■ 
We finish with 

Remark 46 (£o Theorem 15) 

(i) Let Ti = R, T 2 = Z; < a < 1. Then 



6 /.b 

•^ Q x = / -dx, 



and 



/ ■0atf = o5^- + (l-a) 51 

•Jc . . i i 



s/=c y=c+l 

Assume k : [a,b] x [c, d] z — ► R+, a continuous function. So here 

d-1 d 

K(x) = a^2k(x,y) + {l-a) ^ fc(x,y)>0, 

y=c 2/=c+l 

V .x € [a, 6] , and 

/i : [c, d] z — > R, mift 

d-i d 

ffi (a;) = a ^ k (x, y) fr (y) + (1 - a) ^ k(x,y)f l (y) 

y=c V=c+1 

V x € [a, b] , i = 1, ...,n. 

-ffere u : [a, 6] — > R + continuous, and 



v{y) 



u(x)k(x,y) 
K(x) 



dx, 



V y e [c,d] 2 



(113) 



(114) 



(115) 



(116) 



(117) 
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Let $ : R" — ► R convex and increasing per coordinate function. 
(4-4), we obtain 

u(x)& 



K(x) 



dx < 



Then, by 
(118) 



d-1 



E v (y) $ (|/^)|) + (!-«) E «(y) $ (|7Iy)|)- 



t/=c 



J/ = C+1 



(ii) Let Ti = Z, T 2 = R. TTiera 



„h fe-l 6 

/ -O a a; = a^-H-(l-a) ^ •, 

" a x=a x=a+l 



and 

/d />d 

■<> a y = / -dy. 

Assume k : [a, 6] z x [c, d] — ► R+, a continuous function. So here 

,d 
K(x)= / k(x,y)dy > 0, Vx <G [a, 6] z . 



Consider fi : [c, rf] — ► R continuous and 

i-d 



9i(x)= k{x,y) ft{y)dy, Vx e [a,b] z , i = 1, ...,n. 
Let u : [a, 6] z — ► R + . Lfere «£ is 

Y^ u(x)fc(x,y) ^ u{x)k(x,y) 



if (x) 



^ K(x) 

x=a+l v ' 



VyeM. 

Lei $ : R™ — ► R convex and increasing per coordinate function. 
(44); we derive 



b-l 



:Y^ U (x) $ 



sW 



if(x) 



+ (1 - a) V" u(x)$ 

a:— a+1 

«(»)*( 7(y) ) <*». 



sW 



#(x) 



< 



(119) 



(120) 



(121) 



(122) 



(123) 



Then, by 



(124) 
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3 Appendix 

We give 

Proposition 47 Let t n ,t,T € {[a,b] flT), T is time scale, and t n — > t, then 
X[a,t n ) ( T ) ~ * X[ a ,t) ( T ); ^e^ a Lebesgue measure A-a.e. in t. 

Proof. Indeed we have the cases: 

A) t„ < t. 

We distinguish the subcases: 
Al) a < t < t n , then X[ a ,t n ) ( r ) = X[ a ,t) ( T ) = 1, 
A2) t < t < b, then X[a ,t n ) (r) = X [a ,t) to = 0, 

A3) t n < t < t, then X[a,t n ) ( T ) = 0' but X[o,t) ( T ) = 1' an d convergence fails. 
As n — ► oo, the delta Lebesgue measure /i A ([£„,£)) = (t — t n ) — > 0. Thus 
X[o,t n ) ( r ) -*■ X[a,t) ( T )> A-a.e. in r, when t n -*■ t from the left. 

B) t < t n . 

We distinguish the subcases: 
Bl) a < t < t, then X[ a ,t n ) ( T ) = X[a,t) ( T ) = 1 , 
B2) t n < r < b, then X[a ,t n ) ( T ) = *M) M = °> 

B3) t < t < t n , then X[a,t„) ( T ) — 1; but X[ a .t) ( T ) = 0' an d convergence fails. 
As n -> oo, ^ A ([*,<„)) = (*„ - 1) -> 0. Thus x [a ,t„) CO -> X[a,t) ( T )> A-a.e. 
in r, when £„ — > i from the right, proving the claim. ■ 

Proposition 48 Let y n ,y,x € ([«,&) Hi), T is time scale, such that y n — ► y, 
as n — > oo. TTien X( Vn ,b) ( x ) ~ * X(t,.b) ( x ); A-a.e. in x € [a, 6) n T. 

Proof. Indeed we have the cases: 

A) Case of y n < y, we distinguish the subcases: 
Al) if a < x < y n , then X( Vn ,b) ( x ) = X( y ,b) ( x ) = °> 
A2) if y < x < b, then X( Vn ,b) ( x ) = X( y ,b) ( x ) = *> 

A3) if y n < x < y, then X(y n ,b) ( x ) = 1> but X(y,&) ( x ) = 0' an d convergence 
fails. 

We observe that (see [17]) < /x A ((y„, y]) = a (j/) - a (y„) = y - a (y n ) < 
V - Vn -*■ 0, proving X(y n ,&) (») -" X(j,,6) ( x )> A-a.e. in x e [a, &) n T, when 
TJn — * y from the left. 

B) Case of y < y n , we distinguish the subcases: 
Bl) if a < x < y, then X( Vn ,b) ( x ) = X( v ,b) ( x ) = °i 
B2) if y n < x < b, then X( Vn ,b) ( x ) = X{ y ,b) ( x ) = h 

B3) if y < x < y n , then X( Vn ,b) ( x ) = °> but X( y ,b) ( x ) = ^ and convergence 
fails. But (by [17]) we have fi A {{y,y n \) = <J (y n ) - a (y) = er (y n ) - y -> 0, 
since a is rd-continuous and y is a right dense point ([12]). So we proved that 
X( Vn ,b) ( x ) ~* X(y.b) ( x )> A-a.e. in x e [a,b) fl T, when y n — > y from the right. 
That is proving the claim. ■ 
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DIFFERENCE SCHEME FOR SOLUTION OF THE DIRICHLET'S 

PROBLEM 

GALINA MEHDIYEVA AND AYDIN ALIYEV 



Abstract. In present work for numerical solving Dirichlet's problem for Lap- 
lace's equations, there is applied difference scheme of higher accuracy order, 
allowing getting estimate of the error of order 0(h 4 ), only known data take 
part in this estimate. And this, in its turn, allows applying computational 
techniques to estimating the error and getting concrete error values. 

Representation of difference problem (five-point scheme) solution for Lap- 
lace's equation on rectangle with aid of discrete analog of Fourier method, 
suggested and grounded in [2], is the main tool for creating its solutions both 
on whole net domain [4], [5], and on net segments [6], and what is more, not 
only rectangular, but and on multistcp domain as well. Besides, error estimates 
(of second order regarding net step h), inferred in [2], are effective, i.e. depend 
only on known quantities. 

In works [4], [5], though representation of nine-point scheme solution is 
used, but its convergence to solution of the differential problem is not grounded. 

Effective estimates for nine-point scheme, not representing the solution by 
Fourier method, are obtained in [7]. 



1. Introduction 

Let us denote through II a rectangle with vertices (0, 0), (1, 0), (1, b), (0, b), where 
6-rational number. Let T-boundary of this rectangle. 

We introduce net square by lines x = Xi — ih, y — yj = jh (i = 0, 1, ..., l/h,j — 
0, 1, ..., b/h), where 1/h and b/h - integer numbers. Let 

Ilh = {(x,y) : x = Xi = ih, i = 0, 1, ..., l/h,y = yj = jh, j = 0,1, ..., b/h} , 

and I\- set of net knots, lying on T. Let, further T\ h — {(x,y) : x — ih, i — 
l,...,n, y = 0}. 

Consider Dirichlet's problem 

Am = on II, , . 

«lr = /> ( ' 

where / - defined on T and has fifth derivative on each side of T. 
Through A^u denote Laplace's nine-point difference operator: 

A/jU — u{x + h,y + h) + u{x — h,y + h) + u(x — h,y — h) + u(x + h,y — h)+ 

+A[u(x + h,y) + u(x, y + h) + u{x — h,y) + u(x, y — h)] — 20u(x, y). 
Through Q(x, y) denote special polynomial of fourth degree. 

Q(x, y) = a 3 ix 3 y + a 30 x 3 + a 13 xy z + a Q3 y 3 + a 2 ix 2 y+ 



Key words and phrases. Estimate error, Fourier method, difference scheme. 
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+ai 2 xy 2 + auxy + a 20 x 2 + a 02 y 2 + a w x + a 01 y + a 00 , 
where 

aoo = /(0, 0), ooi - /(0, b) - -f^O, b) - ^(0, 0) - /(0, 0), 

«02 = ^(0,0), a 03 - ^^(0,6) - ^/;;(0,0), 

oio - /(1,0) - ^(1,0) - ^(0,0) - /(0,0), 
o 3 

«n - \f"(.l,b) - l b fL(lM + 1/JJhO) + 1/^(0,0) - l/J x (0,b) ^/^(1,6)+ 
+ ^O > & ) - ^„(1.0) - ^C(°'°) - \f"i 1 ^) /(0,6) + /(0,0), 



-f" fl 0) - - 



ai2 = ^/^(l,0)--/^(0,0) 

«i3 - ^.C(M) - ^OM - ^/ w (l,0) + t^i/™(0,0), 



« 20 = */" x (0,0), 021 - ^/^(0,6) - ^/x,(0,0), a 30 = */"*(!, 0) - */^(0,0), 
«3i = ^/xx(1.6) - ^/xx(l.O) + ^/xx(0,0) - ^/^ x (0,6). 

2. Results 

It is easy to check that Q(x, y) is a harmonic function on the vertices of the 
rectangle and the following conditions: 

nt\- t (\ d 2 Q( P ) _ d 2 f( P ) d 2 Q( P ) _ d 2 f( P ) 
wip; /w, aa;2 aa , 2 , dy2 dy2 ■ 

If we replace boundary function / with tp h — f — Q, then we receive new problem, 
for which the estimate error is the same as for our problem. As estimate of the 
error can be represented in view of sum of four members, each, of which correspond 
to boundary quantities, equal to zero, except for one of four sides of the rectangle. 

Through tp h denote assigned function on T^: 

_ J 0, on vertixes II, 

Vh = I / - Q, v = 0. 

So, the following difference scheme is considered: 

A h u h =0on II/j, 

<Ph on T lh , } (2.1) 



'"' _ ' o on r fc \r lh 

It can be easily verified that the solutions of problems (1.1) and (2.1) are defined 
accordingly by formulas 

oo 

u(x, y) = ^ c n g(y, nir) sin nirx, (2.2) 

71=1 

1/h 

Uh(x,y) = ^2j n g(y,f3 n /h)smnirx, (2.3) 
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where 



c n = 2 / ip h (t) sin nirtdt, 
o 

l/h 

r ) n = 2hy tp h (rh) sin rnrrh, 



(2.4) 



and j3 n is denned from 



g{y,z) = 

-4- 



s/i(6 — y)z 



shbz 
sinnhir/2 



1 - | sin 2 nhir/2 



The solution of difference scheme (2.1) will be taken as an approximate solution 
to (1.1) at the nodal points. To evaluate the accuracy of the method, we first prove 
some properties of j n , C n and g. 

In order to estimate the approximate values of the Fourier coefficients C n , we 
integrate by parts five times in the right member of the formulas for C n and obtain: 



C„ 



2 / (/ - Q) sinmrtdt = 2 

o 



if-Q) 



cos nirt 



cos nirt 



(/' - Q')dt 



1 



//,T 



(/' 



,. sinn7ri 



1 f sinnirt 



if" - Q")dt 



= 2 



= 2 



_J^ cosmrt 

nir nir 



cos n7rt 

(i (nir) 3 J //,: 
o 



(f'"-Q'")dt 



' ( f " - Q'") sinwrf 

nir 



(nir) 3 

1 



sin n7ri 

(nir) A J irn 
o 



(/ (JV) _ Q(IV) )dt 



(nir) 1 



_(f(IV)_ Q (IV) ) 



cos nirt 



o ( n7r ) 5 



(n7r) 5 
It follows that 



(-i)"~7 (/y) (i)-/ (/y) (o) 



(nirY- 



f {V) (t) cos mrtdt 



f (v) (t) cos nirtdt. 



\C n \ ^ Kn~\ 
where 

Jf=4[/Cv)(l) + /Cv)(0)l+4m 

7T L J 7T 

Further, we note that ([5]) 

7„ = C„ (n = 1,2, ...,1/n). 
Then it is necessary to estimate the difference 



(2.5) 



/ (y) (t) 
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The expression %■ = arsh— t= sm[ - n l7V ' > can be expanded in a Taylor series in 

-^1-2/3 sin 2 (ri/i7r/2) l J 



powers of nInr/2, we obtain 



P„ nhir fnhir\ 



where 



R = — max 

5! o^xs;^ 



l {v) {x) , l{x) = arsh 
Hence, given R ^ 1/30 that, we obtain: 



smi 



1 — | sin x 



\^-nhM<^^- (2-6) 



Further 

-sh~ bz {(26 — y)shyz — ysh(2b — y)z} . 



dg(y,z) 1 , .,, 



dz 2 

Given that sht/t is a non-decreasing function of t > 0, we have: 

(26 — y)shyz $i ysh(2b — y)z. 

Therefore, 



9g(y, z) 



% ) shr 2 bzsh(2b-y)z, z > 0, y > 0. (2.7) 



9.2 

For this expression we use the obvious inequality 

shkt ^ exp((fc - l)t)s/ii for ^ k s£ 1 
and 

sM ^ -(1 -exp(-2ii))exp(t), t > *i > 0. 

We also need the inequality 

4n /3 1 

— < — s$ V3n7r for 1 < n < -. 

3 /i n, 

The right-hand side of this inequality follows from the expression /3 n /h, and the 

left part can be proved as follows: 

1 ^ , 1 = ^ y/3, 

<\-\ sin 2 ^f 

, B„ nhir sin ^^ 1 nhir 2 
sh-^- = — j-^ = ^ — = nh 

2 2 ^ yd - § sin 2 ^ 2 - 

and 

S/1 | < v^ < |V3. 
Given that shx/x non-decreasing function, we have: 
sh^ sh[arsh^^/3] fv^3 







Hence 



< — " ^^ = — 7= ^ "• 

— ars/i^VS ars/i|V3 2 



In \ " i Pn \ 2 , 

— ^ -s/i — ^ -n/i. 
2 2 2 3 
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Taking this into account in (2.7), we obtain: 
dg(y, z) 



^in 1 -^^ 



^ I 1 — exp 



exp(— 2bz) cxp(— yz)sh2bz ^ 



yexp(-yz), 



for 



Using (2.6) we obtain: 



\/3n7r ^ z ^ ^ ^ |n, 
Hn^ 1/h, 



\g(y,(3Jh)-g(y,mr)\ $ (l - exp (-f )) 2 ycxp(-yz) h 4 ^^ 



^(l-exp(-f))- 2 ^exp(- 



4njA -.5 1,4 



Completing the preliminary observations by the inequality 

which is a consequence of the definition g(y, z). 

Now we can evaluate |u — Uh\- We write, using (2.2) and (2.3) 

|u - Uh\ < Ri + -R2, 

1/h 

Ri=J2 \c n \\g{y,rm) - g{y,PJh)\, 



71=1 

#2 = 



E \cn\g(y,mr). 

n=l+l/h 



From (2.5) and (2.9) 

OO 

n=l+l/h 

In order to assess R\, observe that by (2.5) 

|c„K Kn- 5 . 
Using this and (2.8), we obtain 



2 ' 



1//1 



|i2i| ^ K^n-Hl 



exp 



480 



yexp 



3, 



^ X yK 1 1 - exp 

480 y ! 



480" 
86 

y 



n=l 



exp 



4y 



Aftc 



we get 



(exp(f)-l) 



^ - for y ^ 0, 



i?! ^ X ( I - exp 

1 640 ' ' 



■f" "* 



iF) "^ 



4ny\ 



(2.8) 



(2.9) 
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So, 



\u-u h \ ^K{0,5 
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ASYMPTOTIC DISTRIBUTION OF VECTOR VARIANCE 
STANDARDIZED VARIABLE WITHOUT DUPLICATION 

ERNA T. HERDIANI AND MAMAN A. DJAUHARI 



Abstract. In recent years, the use of vector variance standardized variables as 
a measure of testing equality of correlation matrices has received much atten- 
tion in wide range of statistics. This paper deals with a more economic measure 
of testing equality of correlation matrices, defined as vector variance standard- 
ized variables minus all duplication elements. For high dimensional data, this 
will increase the computational efficiency almost fifty percent compared to the 
original vector variance standardized variables. Its sampling distribution will 
be investigated to make its applications possible. 



1. Introduction 

The stability of correlation matrix is one of the most important problems in eco- 
nomic development and financial industry. We can see in the literature that, since 
the last decade, that problem can be found in a wide spectrum of applications. Its 
applications in property, real estate and asset businesses can be seen, for example, 
in eichholtz (1995), Lee(1998), Cooka el al. (2002) and Fischer (2007). Other appli- 
cations such as in equity market, global market and risk management are presented 
by Meric and meric (1997), Tang (1998), Gandc and Parsley (2002), Annaert et 
al. (2003), Ragea (2003), Da costa et al. (2005) and Goetzmann et al. (2005), 
Michelle et al. (2010). We can also find its application in parallel computation of 
high dimensional robust correlation matrices in Chilson et al. (2006). 

Hypothesis testing about the stability of correlation structure is one of the fun- 
damental issues in multivariate analysis. It is usually realized based on likelihood 
ratio test (LRT). See, for example, Box (1949), Lawley (1963), Fullback (1967), 
Aitkins (1968), and Jennrich (1970). In this research area is to have a better un- 
derstanding of the covariance matrix of sample correlation matrix. Several research 
have been realized in this area. They were Browne and Shaphiro (1986) have initi- 
ated to study the asymtotic behavior of the covariance matrix of sample correlation 
coefficient. Its followed by Neudecker and wesselman (1990) and Neudecker (1996) 
who have reported the asymtotic behavior of the covariance matrix of sample cor- 
relation matrix. Its result used to testing the stability of correlation matrix when 
sample correlation matrices are depence, Schott (2001, 2007). 

In this paper, we consider the case where sample correlation are independent. In 
this case, M statistic of Box (1949) and J-statistic of Jennrich (1970) used statisti- 
cal test for testing the stability of correlation matrix. However, their application in 
practice is not without limitations. M statistic as computation of matrix determi- 
nant and J statistic involves matrix inversion. The former needs the condition that 



Key words and phrases. Asymptotic distribution, correlation matrix, likelihood ratio test, 
vector variance standardized variables, vector variance. 
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all sample correlation matrices are positive definite which is not always satisfied in 
practice. This condition is not apt for high dimension data sets because its com- 
putational efficiency becomes low. To handle this obstacles, Djauhari and Herdiani 
(20f0) proposed a new statistical test based on what we call vector variance of 
standardized variables (VVSV). 

The proposed test is constucted based on vector variance (W). The role of 
VV could be increasing the computational efficiency of fast minimum covariance 
determinant (FMCD) algorithm, Herwindiati et al. (2007). Let P be the correlation 
matrix of the population under studi. Vector Variance of Standardized Variable 
(VVSV) is the trace of the squared correlation matrix population, i.e. VVSV — 
Tr(P 2 ) — \\vec(P)\\ 2 . It is the sum of square of all elements of P. The fact 
that P is symmetric, it is no need to involve all elements of P. The element 
of its upper (lower) triangular matrix are sufficient. This is what we want to 
discuss in this present paper. The rest of the paper is organized as follows. In 
section 2, the problem formulation will be presented. Later on, in section 3, we 
discuss the asymptotic distributional properties of vector variance of standardized 
variables without duplication or modified vector variance of standardized variables 
(MVVSV), i.e. VVSV without all duplicated elements. Our approach will be based 
on the notions of vec operator and commutation matrix. Additional remarks in 
section 4 will close this paper. 

2. Problem Formulation 

Let X is a random vector p dimension with definite positive covariance matrix E. 
By using vec operator, see Muirhead (1982), El Maache and Lepage (1998), Schott 
(1997, 2001) and Djauhari (2007), vector variance (VV) of X is simply ||vec(E)|| 2 . It 
is a multivariate variability measure like Generalized Variance (GV). See Djauhari 
(2007) for an in depth discussion on VV and its asymtotic behavior. In what follows 
our attention will be focused on the case where all variables are standardized. Let X 
is a random vector p dimension with definite positive covariance matrix E. By using 
vec operator, see Muirhead (1982), El Maache and Lepage (1998), Schott (1997, 
2001) and Djauhari (2007), vector variance (VV) of X is simply ||vec(S)|| 2 . It 
is a multivariate variability measure like Generalized Variance (GV). See Djauhari 
(2007) for an in depth discussion on VV and its asymtotic behavior. In what follows 
our attention will be focused on the case where all variables are standardized. 

Let Z be the random vector where its k-th component is the standardized version 
of the fc-th component of X; k — 1,2,..., p. The covariance matrix P of Z is the 
correlation matrix of X. We call the parameter ||-yec(P)|| 2 vector variance of the 
standardized variables (VVSV). Now, let Z\, Z<2, ■■■, Z n be a random sample of size 
n from Z with covariance matrix P. If R is the sample correlation matrix, then we 
call \\vec(R)\\ 2 sample VVSV. 

Let R be a symmetric matrix, there are ~ elements of R which are dou- 
bly counted in |)wec(i?)|j 2 . This is the first problem that we want to discuss in 
this present paper. More specifically, instead of using the vec operator, we pro- 
pose to use further operator which will transform the lower triangular part Rl 
of R into the vector v(Rl) of dimension ^ p „ >p by stacking its column one after 
another. From now on we call the parameter ||w(_R^|| 2 vector variance standard- 
ized variables without duplication or simply modified vector variance Standardized 
Variables (MVVSV). 



ASYMPTOTIC DISTRIBUTION OF VVSV WITHOUT DUPLICATION 3 

It is clear that MVVSV is more economic than VVSV. The second problem is 
to investigate the distributional properties of sample MVVSV. This will guide us 
to a more economic hypothesis testing about the stability of covariance structure 
mentioned in section 1. 

The solution for the first problem is given by schott (1997). Let v,ij be a vector 
dimension ^ p ~ ' p defined as follows. The {(j — \)p + i — 3 ^ ' } — th component is 
equal to 1 and otherwise; i — 1, 2, ...,p and j — 1, 2, ..., i — 1. Let also Eij be a 
matrix of size (p x p) its (i,j) — th element is equal to 1 and otherwise. Schott 
(1997) gives us the following result. 



(2.1) L\ = £>ec(£y)R 

i>j 



and 



(2.2) v(R L ) = L p vec(R). 



Example 2.1. We will illustrate how to transform A is a square matrix or a 

(an ai2 013 \ / 1 «21 031 

021 «22 023 = ^21 1 032 

O31 032 033 / V fl 31 a 32 1 

if A is a correlation matrix, where A is a symmetric matrix, aij — a^ for i =/= j 
and an — «22 = 033 — 1, we will determine of v(Al). 



\u% 



i>j 

L\ = {vec(E 2 i)}u t 21 + {vec(E 31 )}uli + {vec(E 32 )}ul 2 
L\ = {vec I 1 I } I 



l>3 

t 



0/ \ 

\ / 
+{vec I } 1 

1 / \ 

\ / 

^{vec I j } 

10/ \ 1 
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\0 0/ 



/0 10000000\ 

001000000 

000001000 
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\0 0/ 



/ 



Hence 



v(A L ) = L p vec(A). 



010000000 

v(A L ) =(001000000 
000001000 



f 1 1 




«21 




031 




«21 


1 O21 


1 


= 031 


032 


\ «32 


a-31 




a-32 




K 1 J 





3. Distributional Properties of Sample VVSV without Duplication 

Let Xi, X2, ..., X n be a sample random of size n drawn from a p-variate normal 
distribution N p (/j,, S). Then 



y/n^l{vec(R) - vec(P)}^N p 2(0,T), 



where 



a: T = 2M p $M p with M p = \{I p 2 + K pp ), 

b: K pp is the commutation matrix of size (p 2 xp 2 ), 

c: $ = {Ip2 - (j p P)A P HP(g) P}{I p2 - (I p <g> P) A p }, 

d: A p = Y]^—i ( e.% e* (^) e» e* ), e, is the i — th column vector of identity matrix 
Ip of size p x p. 

From this result, if the transformation is used on R, by using the result in 
Muirhead (1982, p. 5) we have: 



Proposition 3.1. 



Vn - \{v{R L ) - v(P L )}^N p 2(0, A), 



where k = ^^ and A = 2L p M p $M p L p 
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Proof. v(Rl) — LpVec(R). Muirhead (1982) said if C is a real matrix with size pxq, 
q<p, rank(C) = q and vec(R) ~ N p 2(vec(P), ^), then Cvec(R) N p (Cvec(P), C^C 1 ). 
So, if C = L p then we can get 

r 



v{R L ) = L p vec(R) ~ N k (L p vec(P),L p -L*) 



n-1 



where 
and 



£(v(fli)) =)X pV ec(P) = v(P L ) 



var(v(R(L))) — var(L p vec(R)) 
L r I* 



i . ... 1 

n - ' ■ ■ ' Jp 



— L p (2M p $M p )L; 
2 



- T L p M p $M p Li. 

n — 1 ^ 

Therefore, we can conclude that v(Rl) be have distribution multivariate normal 
with p-variate with mean vector v(Pl) and variance covariance matrix -^—^L p M p QM p L p . 

□ 

According to Corollary 3.2 and Proposition 3.3 in Djauhari (2007), if we de- 
fine u(v(Rl)) — \\v(Rl)\\ 2 , then we arrive at the following proposition about the 
asymptotic distribution of sample VVSV without duplication. 

Proposition 3.2. 



V^1\\v(Rl)\\ 2 - \\v(P L )\\UN(0,a 2 ), 
where 



a 2 = 4(v(P L ))*L p r(i p )*(t;(P L )) and T = 2M p $M p with M p - *(J p2 + K pp ). 

Proof. Based on Proposition 3.1, 

v(R L ) = L p vec{R) ~ N k (v(P L ), —^), 

where fc = ^^ and A = 2L p M p <$>M p L p . Now, 

|KP L )|| 2 = ( W (P L ))*«(P L ) 
= «(v(i2i)) 
because v(Rl) is a vector. D 

Theorem 3.3. If sequence {X n }— ► c , c be a vector constant in R p and {X n }^ N p ( c , S) 
t/ien random variable Y n — u(X n )^ N (ii y , o-y) , where \x Y -* u( c ) arad <jy — > 
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Y n = u{X n ) = «C?) + ( 9U @) 4 (^ - "?) + % 



If X„ -> ~? then quadratic form i?£ — > faster than linear form ( u i^' ) t (X n — 

c ). Y n and V n = u( c) + ( — I ' ) t (X n — c ) convergence in distribution to same 

distribution. Furthermore, X n -» 7V p ( c , S) then Serfling (1980, p. 24) written V n 
± N{^ vl a 2 v ) with /^ = E{V n ) = £(«(■?) + (^gi)*(X^ - "?)) convergence in 

probability to E(u( c )) — u( c) . This case is caused by X n — ► c which means 
E(( d &) t (xZ-^))^'0. So: 

OJin 



o- v = uar(V„ 



,du(~?) sii 



var(u{c) + (^^Y{X n - "?))) 
(^)W(^)( 9u( ^ 






Let 



«(v(i2L)) - ||«(i?L)|| 

= («( J R L )) 4 («( J R L )) 

= (L p vec{R))\L p vec(R)) 

and vec(R)^vec(P), see El Maachee (1997), so that L p vec(R)^ L p vec{P) . In other 
words, v(Rl)^v(Pl)- Therefore, 



Y n = u(v{R L )) 

dv(R L 



«P L )) + ( d ^W L ^ y(v(RL) - v(P L )) + % 



where i? € = §(v(fli) - w(P L ))M ? («(i? L ) - v(P L )). So, 

F„ = u(v(R L )) 

= («(iJi)*(t;(iJi) 

= (LpVeciRj)* (L p vec(R)) 



N(n Y ,a 2 



Y)i 



where 



My ^ u(v(P L )) 
= \HPl)\\ 2 
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and 

2 p 1 du{v{R L )) t t du(v(R L )) 

° Y - n (( dv{R L ) )V^ P (( dv{ R L ) ) 

£ i(2( V (P L )))*L p AL*(2( V (P L ))),t;(72 L ) = „(P L ) 

£ 4 («(P L ))%AL>(P L )) 

£ -(£ p vec(P))%AL'(L^ec(P)) 

^ -{LiLpVeciP^AiLiLpVeciP)) 

- -(vec(#))*A(vec(*)),vec(*) = L p L p vec{P). 
So it can be concluded that 

|| W (P L )|| 2 - ( W (P L ))*(«(P L )jiiV(|| w (P L )|| 2 , -( V ec(*))*A(z; e c(*))), 

— * n 

where vec('I') = L p L p vec(P) . □ 

This proposition is seemingly complicated to be used in application because the 
variance of sample VVSV without duplication ||u(Pl)|| 2 is A{v{PL)) t L p T{L p ) t . It 
is involves multiplication of large size matrix Y be (p 2 x p 2 ) . However, the future 
works to helps us; need to simplify the computation of that variance. 

4. Additional Remarks 

If vector variance vec(P) is of dimension p 2 , v{Pl) is of dimension k = p ^ ' . 
This gain is too good to be neglected. Furthermore, Propositions 3.1 and 3.2 have 
made possible the application of modified vector variance standardized variable. 
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Abstract 

Here we develop the Nabla Fractional Calculus on Time Scales. 
Then we produce related integral inequalities of types: Poincare, Sobolev, 
Opial, Ostrowski and Hilbert-Pachpatte. Finally we give inequalities 
applications on the time scales M., Z. 

2000 AMS Subject Classification : 26D15, 26A33, 39A12, 93C70. 
Keywords and phrases: Fractional Calculus on time scales, Nabla Poincare 
inequality, Nabla Sobolev inequality, Nabla Opial inequalities, Nabla Os- 
trowski inequality, Nabla Hilbert-Pachpatte inequality, fractional inequali- 
ties. 

1 Background and Foundation Results 

For the basics on time scales we follow [1], [2], [3], [4], [9], [11], [13], [6], [7], 

[10]- 

By [15], p. 256, for fj,, v > we have that 

x {x - sf- 1 (s - ty- \ _ { X - 1)^- 1 
, r(/i) i» r (/, + !/) ' u 

i 
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where T is the gamma function. 

Here we consider time scales T such that T k = T. 

Consider the coordinatewise Id-continuous functions h a : T x T — »■ R, 
a > 0, such that /i (£, s ) = 1> 

h a +i (t, s) = h a (r, s) Vr, (2) 

V s,teT. 

Here p is the backward jump operator and v (t) — t — p(t) . 
Furthermore for a, (3 > 1 we assume that 

t ^ ^ ^ 

h a -i (t, P (t))%-i (t, p («)) Vr = h a+ p-i (t, p (u)) , (3) 

p(u) 



(t-s) 



k 



valid for all u, t G T : w < £. 

In the case of T = R; then p (t) = t, and hk (t, s) — ^~^ , fc G N 
NU {0}, and define 

1 (a + 1) 
Notice that 



r(a + l) r(a + 2) 

fulfilling (2). 

Furthermore we observe that (a,(3 > 1) 

h a -i (t, t) hp-! (r, u) dr = / — — — — dr 

(by (1)) (t ~ U) - 

fulfilling (3). 

By Theorem 2.2 of [14], we have for k, m G No that 

t ^ ^ ^ 

h k (t, p (t)) h m (r, t ) Vr = h k+m+1 (t, t ) . (4) 

to 
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Let T = Z, then p (t) =t-l,t € Z. Define t° : = 1, r := t(t + 1) ... (t + k - 1 
fceN, and by (2) we have h k (i, s) = ^=j^-, s,teZ 7 ke N . 

Here l/ Vr = E! 0+ i • 

Therefore by (4) we get 

« ( t _ r + lf( r _ to )^_( t _ to )^^ 



fc! m! (A; + m + 1)! 

T=t + 1 V ' 



which results into 



v (t - r + l)*- 1 (r - tp + l)"" 1 (t - t + 1) 

^ (k-l)\ (m-1)! (ife + m-1)! ' U 

confirming (3). 

Next we follow [5]. 

Let a, a e R, define t" = ^±^2, t e R— {..., -2, -1,0}, 7V a = {a,a±l,a± 

2, ...}, notice iV = Z, 0° = 0, t° = 1, and / : N a -»■ R. Here p(s) = s-l, 
cr (s) — s + 1, v (t) = 1. Also define 

and in general 



v/w = t^/M 



where i/ 6 R -{..., -2, -1, 0}. 
Here we set 



\ (i 



(t - s) c 

MM) = r \ J , a>0. 
1 a + 1 



We need 



Lemma 1 Let a > — 1, x > a + 1. TTien 

r(x) i /r(x + i) r(x) 



T (x — a) (a + 1) \T (x — a) T (x — a — 1) 
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Proposition 2 Let a > — 1. It holds 



t /_ „\« /. „\a+l 



(r-,)« ft-.)' 



r(a + l) r(a + 2)' 

T7ia£ is h a , a > 0, on N a confirm (2). 

Next for p, z/ > 1, r < £, from the proof of Theorem 2.1 ([5]) we get that 
(t - p ( S ))^ ( S - p (r))^ 1 (t - p (r))^ 1 



£ 



r» r(p) ~ r(ii + v) 



where r G {a, ...,t}. 

So for £, to G iV a with t < t we obtain 



^ (t-r+1)" ^r-tp + lf (t-tp + 1) 

that is confirming (3) fractionally on the time scale T = N a . 
Notice also here that 



[ b f(t)vt= J2 f(t) 



t=a+l 

So fractional conditions (2) and (3) are very natural and common on time 
scales. 

For a > 1 we define the time scale V-Riemann-Liouville type fractional 
integral (a, b G T) 

Jaf(t)= fha-l(t,p(T))f(T)VT, (7) 

J a 

(by [8] the last integral is on (a, t] fl T) 

J°af (t) =f(t), 

where / G L\ ([a, b] fl T) (Lebesgue V-integrable functions on [a, b] fl T, see 

[6], [7], [io]),te[a,6]nr. 

Notice J\j (t) = f f (r) Vr is absolutely continuous in t G [a, b] fl T, see 

[8]. 
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Lemma 3 Let a > 1, f G Li([a,b]f]T). Assume that h a -\ (s, p (t)) is 
Lebesgue V -measurable on ([a, b] (IT) ; a,b G T . Then J£f G L\ ([a, b] fl T) . 

For u <t; u,t <E T, we define 



£ (t, u) = / /i a _i (t, p (r)) Vi (t, P («)) Vr 

= 1/ («) /i a _i (i, p («)) ^i («, p («)) , (8) 

where a,(3 > 1. 

Next we notice for a, (3 > 1; a, 6 G T, / G la ([o, 6] fl T), and /i a _i (s, p (t)) 
is continuous on ([a, b] fl T) for any a > 1, that 

■£■#/ (*) = / Vi (*>PW) Vt fVi (r,p(«)) / («) Vu. 

J a J a 

Hence 

Wif (t) + f f («) £ (*, «) V« = J a a+/ 7 (t) , V t e [a, b] n T. 

So we have the semigroup property 

■£■#/(*) + I /(tt)«/(ti)Li(t,p(«))Vi(«./'(«))Vtt = ^/(t), 

</ a 

(9) 
V t G [a, 6] n T, with a, 6 G T. 
We call the Lebesgue V- integral 

L> (/, a, P,T,t)= f (u) v (u) h a -! (t, p («)) Vi («, P («)) V«, (10) 
£ G [a, 6] flT; a, 6 G T, the backward graininess deviation functional of 

/eLi([M]nr). 

If T = R, then L> (/, a, /3, R, t) = 0. 
Putting things together we have 

Theorem 4 LetT k = T,a,beT,f G L x ([a, 6] n T); a,/3 > 1; /i a _i (s,p(i)) 

is continuous on ([a, 6] fl T) /or any a > 1. Then 

J?Jif it) + D (/, a, /?, T, t) = J^/ (t) , (11) 

V t G [a, b] n T. 
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We make 

Remark 5 Let \i > 2 such that m — 1 < p < m G N, i.e. m = |~p] (ceiling 
of the number), v = m — p (0 <u < 1). 

Let f G CJ2 ([a, b]C\T). Clearly here ([10]) / ym is a Lebesgue V -integrable 
function. 

We define the nabla fractional derivative on time scale T of order p — 1 
as follows: 

V£"7 (t) = (r a +1 f Vm ) (t) = f% (t, p (r)) r (r) Vr, (12) 

V t e [a, b] n T. 

Notice here that V^T 1 / G C ([a, b] D T) by a simple argument using dom- 
inated convergence theorem in Lebesgue V '-sense. 

If p — m, then v — and by (12) we get 



f a) = Jir {t) = r-\t) . as) 



7 m-l t u\ __ Tl -fV 

a* 



More generally, by [8], given that / v '™ is everywhere finite and ab- 
solutely continuous on [a,b] fl T, then / ym exists V-a.e. and is Lebesgue 
V -integrable on (a, t] fl T, V t G [a, b] fl T, and one can plug it into (12). 

We have 

Theorem 6 Let p > 2, m-1 < /i < m 6 N, v = m-p; f G C z ™([a,o] (IT), 
a,b E T, T k = T. Suppose /i M _ 2 (s,p(t)), h„(s,p(t)) to be continuous on 

([a,o]nT) 2 . 
Then 

fh m -i{t, P {r))f m {T)Vr= (14) 

J a 

4 / vm («) i/ («) Va (t, p («)) fe («, p («)) Vn+ f V 2 (£, p (r)) V^ 1 / (r) Vr, 

a J a 

V i G [a, b] n T. 

We need the nabla time scales Taylor formula 



6 
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Theorem 7 ([2]) Let f G C\ n d (T),mE N, T k = T; a,beT. Then 

m-l „t 

/ (*) = E ** ft °) /V " ( a ) + / ^»-i (*' -° ( r )) /Vm ( r ) Vr ' ( 15 ) 

V t e [a, b] n T. 

Next we present the fractional time scales nabla Taylor formula 

Theorem 8 Let \x > 2, m - 1 < p < m G N, z/ = m - p; f G C$ (T) , 
a,b <E T, Tk = T. Suppose h^-2 (s,p(t)), h„(s,p(t)) to be continuous on 

([a, 6] nT) 2 . Then 



m—l 



f(t) = J2hk(t,a)f k (a)+ (16) 

fc=0 

/ vm («) i/ («) £„_., (t, p («)) fe («, p («)) Vu+ / V2 (*, P (r)) VST 1 / (r) Vr, 

V t e [a, 6] n T. 

Corollary 9 ^4.ZZ as m Theorem 8. Additionally suppose f (a) = 0, k = 
0, 1, ...,m — 1. T/ien 

A(t):=f(t)-D(f m ,f,-l,D+l,T,t) (17) 

= /(*)"/ / Vm («) " («) V* («, P («)) ^ («, P («)) V« 

V £ G [a, 6] n T. 

Notice here that D (/ ym , p - 1,£ + 1, T, i) G C H ([a, 6] n T). Also the 
R.H.S (17) is a continuous function in t G [a, 6] fl T. 
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2 Fractional Nabla Inequalities on Time Scales 

We present a Poincare type related inequality. 

Theorem 10 Let p, > 2, m - 1 < p < m E N, v = m - p; / E C% (T), 
a,b G T , a < b, T k = T . Suppose h^ 2 (s, p (t)) , h„ (s, p (t)) to be continuous 
on ([a,b]nT) 2 , and f k (a) = 0, k = 0,l,...,m- 1. Here A(t) = f (t) - 
D(f vm ,p-l,v + l,T,t), tE [a,b}DT; andletp,q> 1 : \ + \ = 1. 
Then 



b I r b 

q 



\A(t)\ q Vt< 



a \J a 



K-2 (t, P (r)) 



Vr Vt 



v^vwrvt 



Next we give a related Sobolev inequality. 
Theorem 11 Here all as in Theorem 10. Let r > 1 and denote 



l/(*)l r vt 



Then 



(18) 



(19) 



\A\\ r < 



a \J a 



V-2(*,P(t)) 



Vr Vt IIV^T 1 / 



(6 -a) 



a \J a 



(20) 



Next we give an Opial type related inequality. 

Theorem 12 Here all as in Theorem 10. Additionally assume that 

\^a* l f\ is increasing on [a, b] T. 

Then 

[ |A(t)||V£-7(*)|Vt< 

J a 

V2 (*, P (r)) " Vr) Vt) " f /* (V^ 1 / (t)) 29 Vt 



(21) 



(22) 
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It follows related Ostrowski type inequalities. 

Theorem 13 Let p, > 2, m - 1 < \x < m G N, v = m - p,; f G C$ (T), 
a,b E T , a < b, Tk = T . Suppose h^ 2 (s, p (t)) , h„ (s, p (£)) to 6e continuous 
on {[a,b]nT) 2 , and f yk (a) = 0, jfe = l,...,m- 1. denote A(i) = / (t) - 
D(f vm ,fi-l,i/ + l,T,t), te [a,b]f]T. 
Then 

' f A(t)Vt-f(a) 

J a 



b — a 



a \J a 



b — a 



hp- 2 (t,p(T)) 



< 



Vr ) Vt) IIV^-VII r Hrvr - 

■' I M a * J II oo,[a,ojnT 



(23) 



Theorem 14 All as in Theorem 13. Let p,q>l :- + - = !. Then 



< 



1 



b — a 



a \«/ a 



— f A(t)Vt-f(a) 

h,-At,p(r)) P Vr)\t)\\V^f\\ qMnT . 



(24) 



We finish general fractional nabla time scales inequalities with a related 
Hilbert-Pachpatte type inequality. 

Theorem 15 Let e > 0, p > 2, m — 1 < p < m G N, v = m — p; 

fi G Cl2(Ti), ai,bi G Ti, ai < h, T ik = Ti time scale, i = 1,2. Sup- 



;w 



:w 



pose hy_ 2 (si, Pi(U)) , h~ (si, Pi (ti)) to be continuous on ([a», &*] Pi Ti) , and 
fj k {a % ) = 0, k = 0, 1, .., m-l-i = 1, 2. #ere A fe) = /, (^-^(/v^-i, 

v + l,Tj,ti), tj G [oi,6j] HTj; i = 1,2, andp,q > 1 : ± + - q = 1. 

^i) = rffe(*i,/^(ri))|yVTi, 
/or a// ti G [01,61], and 

G(t 2 )= I i h\:LAt,.P-,{To)) )'Vr, 



"1 



/2 



hSLtfatofo)) 
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for all t 2 G [02,62] (where h^_ 2 , Pi are the corresponding h^-2, p to T i: 
i = l,2). 
ThsTi 



1 1 1 ^(*i) 1 G(t 2 ) 



(ft! - Oi) (62 - Oa) (7 ' IVJJ" 1 /! (tl)T V*l) ' (J 2 IV^/2 (t 2 )T Vt 2 N " 

(25) 
(above double time scales Riemann nabla integration is considered in the nat- 
ural interative way). 



3 Applications 

I) Here T = R case. 

Let p > 2 such that m — 1 < /i < m G N, u = m — p, / G C m ([a, 6]), 

The nabla fractional derivative on K of order p — 1 is defined as follows: 

v£r7 (0 = ( j: +1 / m ) («) = YW+i) f a {t ~ r) " /M (r) dr ' (26) 

V t G [a, 6] . 

Notice that V^T 1 / 6 C ([o, 6]), and A (t) = f (t), V * G [a, 6] . 
We give a Poincare type inequality. 

Theorem 16 Let p > 2, m - 1 < p < m G N, / G C m (R), a, b G R, a < 6. 
Suppose f^ (a) = 0, fc = 0, 1, ...,m — 1. Let p,q>l:- + - = l. Then 



(b-ay 

(r(p-i)y(p-i) q ((p-2)p + iy- 



\f(t)\ q dt<— _ Q/ V J„ , -^ ( / \V^f(t)\ q dt 



a 



(27) 



Proof. By Theorem 10. ■ 

We give a Sobolev type inequality. 
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Theorem 17 All as in Theorem 16. Let r > 1. Then 



< 



(1 NM-2+i + i 

(o — a) p r 



r(n-l)((n-2)p+iy P ((n-2)r+ r - + l 



rllV^VIL (28) 



Proof. By Theorem 11. ■ 

We continue with an Opial type inequality. 

Theorem 18 All as in Theorem 16. Assume \V^ 1 f\ is increasing on [a,b]. 



[ 1/(01 |v£:7 (0| dt< 

J a 



(b-aY 



i_l ,/,x\2g 



r( / ,-l)[(( / i-2)p+l)((/i-2)p + 2)]p \Ja 

Proof. By Theorem 12. ■ 

Some Ostrowski type inequalities follow. 



(V£-7(*)r<ft • (29) 



Theorem 19 Let \i > 2, m - 1 < // < m E N, f E C m (R), a, 6 G R, a < b. 
Suppose f^ (a) = 0, k = 1, ...,m — 1. T/ien 



b — a 
Proof. By Theorem 13. 



f(t)dt-f(a) 



< (6-«r' ||v ,- 1 ,, 

~ r(/l + l) H a * ^ lloo,[a,6] ■ 



(30) 



Theorem 20 Here all as in Theorem 19. Let p, q>l :- + - = !. Then 



b — a 



f(t)dt-f(a) 



< 



(b-a) 






r(|i-i)( A *-i)(( A *-2)p + i)5 



- II V -1 /"I 

la* •> 



!o,[a,6] 

(31) 



Proof. By Theorem 14. ■ 

We finish this subsection with a Hilbert-Pachpatte inequality on 
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Theorem 21 Let e > 0, \x > 2, m - 1 < // < m G N, i = 1, 2; £ G C m 

en, h eR, ai < h, ff ] (a t ) = 0, k = 0, 1, ..., m - 1; p, q > 1 : \ + \ = 1. 
C7aW 

pit \ = Wi ~ ai > 

1 } (r(/i-i)) p ((/i-2)p + i)' 

ti G [ai, 6i], and 

rrM _ (* 2 - a 2 ) ( "- 2) * +1 

l2j (r(/i-l)r((/i-2)g + l)' 

^2 g [02,62]- 

TflGTl 

"" IA('0IIA(«»)l^ I(Bl < 



; 1 1 F(tl) 1 GH2) 

V P 9 

(6x - fll ) (62 - 02) (J ' |V£;7i (*i)rd*i) q (J 2 lv^/2 (t 2 )\ p dt^ P . 

(32) 

Proof. By Theorem 15. ■ 

II) Here T = Z case. 

Let fj, > 2 such that m — 1 < /x < m G N, u = m — fj,, a,b <E Z, a < b. 

Here / : Z - R, and / vm (t) = V™/ (t) = Eto (-1)" (™) / (* " *) ■ 

The nabla fractional derivative on Z of order // — 1 is defined as follows: 

t 



V£r7 (t) = (r a +1 (v"7)) (t) = f7 ^— v E (* - r + l f ^ m f) ( r ) » 

^ ' r=a+l 

(33) 

V t G [a, 00) n Z. 

Notice here that v (t) — 1, V t G Z, and 

A(*) = /(*)-D(V"7,/z-l,£+l,Z,f) 

= /(*)- E ( yTn / («)) ^T^Jr" ' (34) 

V t G [a, 00) n z. 

We give a discrete fractional Poincare type inequality. 

12 
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Theorem 22 Let \x > 2, m - 1 < /i < m E N, a, b E Z, a < b, f : Z - 

Assume V k f (a) = 0, k = 0, 1, ..., m - 1. Letp,q>l: J + J = 1. T/ien 



E i^wr < 

t=a+l 

cro^i* ( E ( E <«— dH) ( E ivsrv Ml 

v va* yy \t=o+l \T=o+l / / \t=a+l 

Proof. By Theorem 10. ■ 

We continue with a discrete fractional Sobolev type inequality. 

Theorem 23 Here all as in Theorem 22. Let r > 1 and denote 



(35) 



E i/(*)r 



vt=a+l 

TTien 



s ^l^lS, (, - r+1, ^JJ |vcl ^ <M 

Proof. By Theorem 11. ■ 

Next we give a discrete fractional Opial type inequality. 

Theorem 24 f/ere a// as in Theorem 22. Assume that | V^" 1 /) is increasing 

on [a, b] fl Z. TTien 



E I^(*)I|V£-7(*)|< 



t=a+l 



f^4 f E ( E <«-' + D^) ' ( E (vt-'/M) 2 '' * 

^ ' \t=o+l \r=a+l / / \t=a+l 



(37) 
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Proof. By Theorem 12. ■ 

It follows related discrete fractional Ostrowski type inequalities. 

Theorem 25 Let /i>2, m - 1 < fx < m <E N, a, b E Z, a < b, / : Z 
Assume \7 k f (a) = 0, k = 1, ..., m - 1. 
Then 



r^E^W-/' 



6 — a 



t=a+l 



< 



(6 



^^(i^- T+i) i) i|v " i/L >- (38) 



Proof. By Theorem 13. 



Theorem 26 v4ZZ as in Theorem 25. Let p,q>l :- + - = !. Then 



±- a ±A(t)-f< 



b — a 



t=a+l 



< 



- a) r (u -J t(t<t-T + 



(6-a)T(/i-l) 



I a * ■> \\q,[a,b]nZ 



i=a+l \T=a+l 



(39) 



Proof. By Theorem 14. ■ 

We finish article with a discrete fractional Hilbert-Pachpatte type inequal- 
ity. 

Theorem 27 Let e > 0, /i > 2, m - 1 < /i < m G N; i = 1,2; /» : 

Z — > R, aj,6j G Z, Oj < 6j. Suppose V fc /i ( a «) = 0, k = 0, l,...,ra — 1. 



/j-2 



ifere A ft) = /« ft) - £* =fll+1 (V m / («*)) ( ^^ " > V ** G k °°) n Z / 
CoZZ 



ri=ai+l 



(r Qi-i)) 



p ' 



14 
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V ti G [oi, oo) Pi Z, and 



!i, (+„_ To _LnO'-2)« 






? ' 



Vt 2 6 [a 2 ,oo)nZ. 
Then 



(T(a-l)) 

r 2 =a 2 +l V VA " 



^ , ^ , (p -I- *M 4. g&T 

tl=ai + l t2=02+l I fc T" _ T 

1 

6l \ q / 62 \ P 

IP 



(61-00(62-02) £ l v sr.Vi(«i)r E |v£".72(*2) 



v ti=ai+l / \t2=a2+l 



(40) 



Proof. By Theorem 15. ■ 

We intend to publish the complete article with full proofs elsewhere. 
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NUMERICAL SOLUTIONS OF NONLINEAR SECOND-ORDER 

TWO-POINT BOUNDARY VALUE PROBLEMS USING 

HALF-SWEEP SOR WITH NEWTON METHOD 

J. SULAIMAN, M.K. HASAN, M. OTHMAN, AND S.A. ABDUL KARIM 



Abstract. In this paper, we examine the performance of Half-Sweep Succes- 
sive Over-Relaxation (HSSOR) iterative method together with Newton scheme 
namely Ncwton-HSSOR in solving the nonlinear systems generated from second- 
order finite difference discretization of the nonlinear second-order two-point 
boundary value problems. As well known that to linearize nonlinear systems, 
the Newton scheme has been used to transform the nonlinear system into the 
form of linear system. Then the basic formulation and implementation of 
Newton-HSSOR iterative method are also presented. Numerical results for 
three test examples have demonstrated the performance of Newton-HSSOR 
method compared to other existing SOR methods. 



1. INTRODUCTION 

Boundary value problems occur in many branches of applied mathematics and 
physics such as gas dynamics, quantum mechanics, fluid dynamics, aerodynamics, 
chemical reactions, atomic structures, atomic calculations etc. Most phenomena in 
these problems, however, have been modeled by nonlinear differential equations. As 
a result, they have been studied extensively in recent years in order to find analytic 
and/or approximate solutions. For instance, there are many methods available for 
these problems such as numerical analytic, finite difference, finite element, finite 
volume and boundary element methods. Solving nonlinear boundary value prob- 
lems is very difficult in that only few of them can be solved explicitly. Therefore, 
in this paper, the finite difference method will be considered to develop a reliable 
algorithm in solving nonlinear two-point boundary value problems. By considering 
the second-order nonlinear finite difference approximation equations, a nonlinear 
system can be generated and needs to be linearized through the Newton method in 
order to form the corresponding linear system. Since the characteristics of linear 
systems are large and sparse, iterative methods are the natural options for efficient 
solutions. 

For that reason, various iterative methods also have been studied to yield fast 
numerical solution of linear systems (see Young [22, 23, 24]; Hackbusch [4]; Saad 
[12]). Apart from those iterative methods, the discovery on the concept of the half- 
sweep iterative method has been inspired by Abdullah [f] via Explicit Decoupled 
Group (EDG) iterative method in solving two-dimensional Poisson equations. The 
main characteristic of this concept is that the half-sweep iterative method includes 
the reduction technique in order to reduce the computational complexity of linear 
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(1.1) -—5- = g(x, U,U') , xe[a,b] 
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systems generated from corresponding approximation equations. Following to this 
concept, further investigations have been extensively conducted in [2, 3, 7, 16, 17, 25] 
for demonstrating the capability of the half-sweep iteration. Apart from these 
one-stage iteration concepts, combinations between half-sweep iteration concept 
with two-stage iterative methods namely HSIADE [15], HSAM [9, 18] and HSGM 
[8, 10] have also been constructed and implemented for solving linear systems. 
They pointed out that their proposed two-stage iterative methods arc one of most 
efficient iterative methods in solving any system of linear equations. Due to the 
low computational complexity, this concept has been also used to derive a family of 
multigrid methods [11, 19, 20]. Consequently, Hassan et al. [5, 6] have established a 
family of FDTD methods using this concept in solving wave propagation problems. 
Meanwhile Saudi and Sulaiman [13, 14] applied to solve the robotic path planning. 
In this paper, however, we deal with the application of family of SOR iterative 
methods such as Full-Sweep SOR (FSSOR) and Half-Sweep SOR (HSSOR) by 
applying Newton scheme in solving nonlinear two-point boundary value problems. 
For simplicity, the combination of both proposed iterative methods with Newton 
scheme is identified as Newton-FSSOR and Newton-HSSOR methods respectively. 
To investigate the capability of Newton-FSSOR and HSSOR-Newton methods, let 
us consider a nonlinear second-order two-point boundary value problem defined as 

dx 2 
subject to the boundary conditions 

U(a) = 0o, U{b) = fa 

where /?o and j3\ are constants and g(x, U, U') is a nonlinear continuous function. 
For the sake of simplicity, we shall restrict our discussion onto uniform node points 
only as shown in Figures 1 and 2. Let assume the solution domain (1) can be 
uniformly divided into m = 2 p ,p > 2 subinterval in which its distance, Ax is 
defined in equation (1.2). 

b — a 
(1.2) Aa; = = h, n = ra - 1 

m 

Based on Figures 1 and 2, both figures show the finite grid networks to facilitate 
us for the implementation of proposed computational algorithms. For that reason, 
the implementation of the point iterative algorithms will be applied onto the interior 
solid node points of type • only until the iterative convergence fixed is achieved. 

2. SECOND-ORDER HALF-SWEEP FINITE DIFFERENCE 
APPROXIMATION 

Using the approach of second-order discretization scheme, the nonlinear finite 
difference approximation equation for problem (1.1) can be easily shown as 

(2.1) Ui-i - 2U t + U i+1 - h 2 g(xi, U u ^ ±i ^^ i ) = 

for i = 1,2,3, ...,n. Equation (2.1) denotes as the full-sweep nonlinear approx- 
imation equations. Now using the second-order half-sweep scheme, we get the 
half-sweep nonlinear finite difference approximation equations 

(2.2) C/ 4 _ 2 - 2U t + U i+2 - 4h 2 g(Xi, U u Ui+2 ~ Ui ~ 2 ) = 

Ah 
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Figure 1. shows the 
distribution of uniform 
node points for the full- 
sweep cases respectively 
at n = 7. 



■4- 



-0- 



t0- 



■4- 



— 0- 

i 



Figure 2. shows the 
distribution of uniform 
node points for the half- 
sweep cases respectively 
at n = 7. 



for i = 2,4,6, ...,n — 1 . Then let us define the nonlinear function,/ in general form 
based on equations (2.1) and (2.2) as 

Ui+2 — Ui-2 . 



(2.3) fi(Ui p , U2p,...,U(( n +i)/p)-i) = Ui-2-2Ui+U i+ 2-4h g{x il V i 



4h 



for i = lp, 2p, 3p, ..., ((n + l)/p) — 1 , where the value of p in equation (2.3), which 
equals to 1 and 2, indicates the full- and half-sweep cases respectively. Now we 
consider all interior node points in the solution domain (1.1) to be imposed over 
equation (2.3) in order to generate a nonlinear system 



JipWip ,u 2p 



V, 



w 



(2.4) 



((n+l)/p)-l> 
Jlp\ u lp > u 2p >"-> ty ((„+l)/ p )_lJ 



= 



f {{n+ i)/ P )-i{U[ k p\u^,...,ul k) 



((n+l)/p)-l> 







where, Jj\ ,i — lp, 2p, 3p 



((n + l)/p) — 1 indicate as the k th estimation for 
corresponding exact solutions. To solve the nonlinear system (2.4), the Newton 
method was used to linearize and transform the original nonlinear system into a 
linear system which can be rewritten in the matrix form as 



(2.5) 
where, 



J 



(ff«) Ah, = - fj (tf«) 



J U {k) = 



dfip 
dU lp 
dh P 
dU lp 


dfip 
dU 2p 

9f2p 

dU 2p 


3/((n + l)/p)-l 

dU lp 


9/((n + l)/p)-l 

dU 2p 



dfi 



01', 



OC, 



((n + l)/p)-l 

Oj2p 



((n + l)/p)-l 



9/((n + l)/p)-l 
9C, ((»+1)/P)-1 
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Ahj = [ A/iip, Ah 2p , Ah 3p , . . . , Ah(( n+1 y p - ) _ l ] 

From equations (2.5), the value of the vector, U^ needs to be calculated by 
solving the linear system. Then estimate solutions of U/ L , % = lp, 2p, 3p, ..., {{n + 
l)/p) — 1 can be determined by using the following expression 

(2.6) U\ k +1) = Vf } + Ahi, i = lp,2p,3p,...,((n+l)/p)-l 

3. FORMULATION OF FAMILY OF SOR METHODS 

As above-mentioned in the second section, the coefficient matrix, A of linear 
systems in equations (2.5) is sparse and large. Let the linear system in equations 
(2.5) be rewritten as 

(3.1) AU = F 

Consequently, iterative methods are proposed being as the natural options for ef- 
ficient solutions of sparse linear system. To solve linear system (3.1), we propose 
and construct FSSOR and HSSOR iterative methods being used as linear solver. 

Actually, Young [22, 23, 24] initiated Successive Over-Relaxation (SOR) method, 
which is one of the most known and widely used iterative techniques to solve any 
linear systems. To derive the formulation for FSSOR and HSSOR iterative methods, 
let the coefficient matrix, A in equation (3.1) be decomposed as 

(3.2) A = D + L + V 

where L, D and V are lower triangular, diagonal and upper triangular matrices 
respectively. Now using the decomposition in equations (3.2) and determining 
values of matrices D, L and V, therefore, the general scheme for FSSOR and HSSOR 
methods can be stated as [16, 22, 23, 24] 

(3.3) U {k+1) = (1 - uj)U {k) + u(D + L)- 1 (-VU {k) + f\ 

where ui and U^ k ' represent as a relaxation factor and an unknown vector at the k th 
iteration respectively. The choice of relaxation factor depends upon the properties 
of the coefficient matrix, A. In addition, a good choice of parameter can improve 
the convergence rate of iteration process. In practice, the optimal value of u> in 
range 1 < lu < 2 will be obtained by implementing several computer programs and 
then the best approximate value of lu is chosen in which its number of iterations is 
the smallest. 

Due to the advantage of half-sweep iteration concept applied onto the HSSOR 
method for reducing its computational complexity, we examine the efficiency of 
FSSOR and HSSOR iterative methods in solving equations (1.1). As taking ui = 1, 
SOR method reduces to Gauss-Seidel (GS) method. Later GS method will be used 
as control method. According to equations (3.3), the general algorithm for a family 
of SOR iterative methods to solve problem (3.1) would be generally described in 
Algorithm 1. 

Algorithm 1: FSSOR and HSSOR schemes 
i. Initialize 

f7(°)«-0, e^HT 10 
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ii. For i = lp, 2p, 3p, ..., ((n + l)/p) — 1 , calculate U i - k+1 from 

fj(k+i) ^(D- loL)- 1 ((loV - (1 - LuL))U (k) + lof] 



iii. Check the convergence test, 



U(k+1) _ U(k) 



< e. If yes, go to step (iv). Other- 



wise go back to step (ii) iv. Display approximate solutions. 

4. NUMERICAL EXPERIMENTS 

In order to validate the effectiveness of the FSSOR and HSSOR iterative meth- 
ods together with Newton approach namely Newton-FSSOR and Newton-HSSOR, 
three nonlinear example problems were tested. For the sake of comparison, three 
criteria will be considered for family of SOR methods such as number of iterations, 
execution time and maximum absolute error. In the implementation of the iterative 
methods, the convergence test considered the tolerance error, e = 10~ 10 . 

Example 1 [21] For comparison purpose, we consider the following nonlinear 
two-point boundary value problem 

Subject to the boundary conditions 

17(0) = 4, U(l) = 1. 
Then boundary conditions and the exact solution of the problem (4.1) were defined 

by 

(4.2) U(x) = —L- j x€[0,l]. 



Example 2 [21] Let consider the following problem 

dx 

with the boundary conditions 



(4.3) -^-j = U 3 - UU', »G[1,2]. 



U{1) = \, [7(2)4 
Then boundary conditions and the exact solution of the problem (4.3)were defined 

by 

(4.4) t/ (a; ) = _l_ 3 € [1,2]. 

Example 3 [21] The third nonlinear boundary value problem, we consider as 
follows 



(4.5) — = , a;g [1,3]. 



d 2 U 32 + 2x 3 - UU' 
Ox 
Subject to the boundary conditions 

U(l) = 17, C/(3) = y 
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Then boundary conditions and the exact solution of the problem (4.5) were defined 

by 

16 



(4.6) 



U{x) = x z + 



(,:) 



e[i,3]. 



For all above examples, results of numerical experiments obtained from imple- 
mentation of the FSGS, FSSOR and HSSOR iterative methods, have been recorded 
in Tables 1,2, and 3. 

Table 1. Comparison of number of iterations, execution time (in 
seconds) and maximum errors for the iterative methods with the 
optimal value of ui for example 1. 



Number of iterations 



Methods 



Mesh size 



512 



1024 



2048 



Newton-FSGS 
Newton-FSSOR 

Newton-HSSOR 



817596 

5890 
(w=1.98762) 

2984 
(w=1.9757) 



2853149 

11258 

(w=1.99363) 

5890 
(w=1.98761) 



9767783 

21706 
(w=1.99687) 

11258 
(w=1.99363) 



4096 



32773526 

41526 

(w=1.99838) 

21708 
(w=1.99687) 





Execution 


time 


(seconds) 






Methods 






Mesh size 








512 




1024 


2048 


4096 


Newton-FSGS 

Newton-FSSOR 

Newton-HSSOR 


10.24 
0.10 
0.03 




71.25 
0.33 
0.09 


487.01 
1.23 
0.33 


3266.51 
4.65 
1.27 


Maximum absolute errors 


Methods 






Mesh size 








512 




1024 


2048 


4096 


Newton-FSGS 

Newton-FSSOR 

Newton-HSSOR 


3.3005e-6 
1.8275e-6 
7.2994e-6 




7.1831e-6 
4.6509e-7 
1.8275e-6 


2.7356e-5 
1.2609e-7 
4.6509c-7 


1.0899c-4 
2.5021c-8 
1.2600c-7 



5. CONCLUSION 

In this paper we have examined the performance of FSGS, FSSOR and HSSOR 
iterative methods together with Newton scheme for the numerical solution of non- 
linear two point boundary value problem using second order finite difference approx- 
imation equations. To demonstrate the efficiency of the three proposed methods, 
three nonlinear examples are presented. Through the numerical results obtained in 
Tables 1, 2, and 3, Newton-HSSOR method is compared with the Newton-FSGS and 
Newton-FSSOR methods. As a result, it shows that the Newton-HSSOR method 
is superior in terms of the number of iterations and the execution time for different 
mesh sizes. This is because of the computational complexity of Newton-HSSOR 
method is approximately 50% compared to the full-sweep case. Overall, the ap- 
proximate solutions for all three proposed methods are in good agreement. For the 
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Table 2. Comparison of number of iterations, execution time (in 
seconds) and maximum errors for the iterative methods with the 
optimal value of u> for example 2. 



Number of iterations 



Methods 



Mesh size 



512 



1024 



2048 



4096 



Newton-FSGS 
Ncwton-FSSOR 

Newton-HSSOR 



611486 

4184 
(w=1.98827) 

2168 
(w=1.97554) 



2041033 

8109 
(w=1.99414) 

4184 
(w=1.98827) 



66755841 

15464 

(w=1.99694) 

8109 
(w=1.99414) 



21457588 

30298 
(w=1.99847) 

15464 
(w=1.99694) 





Execution 


time 


(seconds) 






Methods 






Mesh size 








512 




1024 


2048 


4096 


Ncwton-FSGS 

Newton-FSSOR 

Newton-HSSOR 


7.50 
0.07 
0.03 




49.70 
0.24 
0.07 


323.06 
0.86 
0.24 


2067.58 
3.31 
0.89 


Maximum absolute errors 


Methods 






Mesh size 








512 




1024 


2048 


4096 


Newton-FSGS 

Ncwton-FSSOR 

Newton-HSSOR 


2.5066e-6 
4.7539e-9 
2.2045c-8 




1.0016e-5 
1.1608e-8 
4.7539e-9 


3.9785e-4 
2.6041c-8 
1.1608c-8 


1.5742e-4 
4.8332e-8 
2.6041e-8 



future works, The method can be extended to solve multi-dimensional nonlinear 
problems. Apart from single step iterative methods, two-step iterative methods 
(Ruggiero & Galligan [26]; Evans & Sahimi [27]; Sahimi et al. [28]; Sulaiman et al. 
[15]) are also interesting to be examined for the solution of nonlinear problems. 
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A RELATED FIXED POINT THEOREM IN n INTUITIONISTIC 
FUZZY METRIC SPACES 

FAYQAL MERGHADI 

Abstract. We prove a related fixed point theorem for n mappings in n In- 
tuitionistic fuzzy metric spaces using an implicit relation which generalizes 
results of Aliouchc and Fisher [2], Merghadi and Aliouche [12] and Rao ct al. 
[15]. 



1. Introduction and Preliminaries 

The theory of fuzzy sets was introduced by Zadeh [20] in 1965. Since then, to use 
this concept in topology and analysis, many authors have expansively developed 
the theory of fuzzy sets and applications, for example, Deng [5], Ereeg [6], George 
and Veeramani [7] , Kramosil and Michalek [9] have introduced the concept of fuzzy 
metric spaces in different ways. One of the most important problems in fuzzy 
topology is to obtain an appropriate concept of intuitionistic fuzzy metric space. 
This notion has been introduced and studied by Park [13]. Alaca et al. [1] have 
redefined the concept of intuitionistic fuzzy metric spaces, according to concept of 
fuzzy metric spaces and proved Intuitionistic fuzzy Banach and Intuitionistic fuzzy 
Edelstein contraction theorems, with the different definition of Cauchy sequences 
and completeness. 

Recently, Merghadi and Aliouche [12] Aliouche and Fisher [2], Aliouche et.al 
[3] and Rao et.al [15] proved some related fixed point theorems in compact metric 
spaces and sequentially compact fuzzy metric spaces. Motivated by a work due to 
Popa [14], we have observed that proving fixed point theorems using an implicitly 
relation is a good idea since it covers several contractive conditions rather than one 
contractive condition. 

Definition 1.1. [17] .A binary operation * : [0, 1] x [0, 1] — ► [0, 1] is a continuous 
t—norm if it satisfies the following conditions: 

1. * is associative and commutative, 

2. * is continuous, 

3. a * 1 = a for all a G [0, 1] 

4- a *b < c* d whenever a < c and b < d, for each a, 6, c, d G [0, 1] . 

Two typical examples of a continuous t— norm are a*b — oh and a*b — min{a, b}. 

Definition 1.2. [17] .A binary operation <0 : [0,1] x [0,1] — ► [0,1] is a continuous 
t—conorm if it satisfies the following conditions: 
1. is associative and commutative, 



Key words and phrases. Fuzzy metric space; implicit relation: Intuitionistic fuzzy metric space; 
related fixed point. 
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2. is continuous, 

3. a()0 = a for all a € [0,1], 

4- a()b < c(><i whenever a < c and b < d, for each a, b, c, d G [0, 1] . 

Examples of a continuous t— conorm are a0& = max {a, b} and a()b = min{l, a + 
b}. 

The concept of intuitionistic fuzzy metric space is denned by Park [13]. 

Definition 1.3. A 5— tuple (X, M,Af, *,()) is called an intuitionistic fuzzy metric 
space if X is an arbitrary (non-empty) set, * is a continuous t-norm, <0 a contin- 
uous t-conorm and M,Af are fuzzy sets on X 2 x ]0, +oo[, satisfying the following 
conditions for each x, y, z G X and t, s > 0, 

1. M(x,y,t)+Af(x,y,t) < 1; 

2. M(x,y,t) > 0; 

3. M (x, y, t) = 1 if and only if x = y; 
4- M(x,y,t)=M(y,x,t); 

5. M (x, y, t)*M( y, z,s) < M (x, z, t + s); 

6. M (x, y, •) : ]0, +oo[ — ► [0, 1] is continuous; 

7. Af (x, y, t) = if and only if x — y; 

8. Af(x,y,t)=Af(x,y,t); 

9. N (x, y, t) 0JV (y, z,t)>Af (x, z, t + s); 

10. Af(x,y,t) : ]0, +oo[^ [0,1] is continuous. 

Then (M,Af) is called an intuitionistic fuzzy metric on X. The functions M (x, y, t), 
Af (x, y, t) denote the degree of nearness and the degree of non-nearness between x 
and y with respect to t, respectively. 

Every fuzzy metric space (X, M, *) is an intuitionistic fuzzy metric space of the 
form (X, M, 1 — M, *,<>}) such that t-norm * and t-conorm are associated [11], 
i.e. x()y = 1 — ((1 — x) * (1 — y)) for any x, y € X. 

Lemma 1.4. [13]/n intuitionistic fuzzy metric space X, M (x, y, •) is non- decreasing 
and Af (x, y, •) is non-increasing for all x, y G X . 

Example 1.5. Let (X, d) be a metric space. Denote a * b — ah and a(}b — 
min {1, a + b} for all a,b € [0, 1] and let Md and Afd be fuzzy sets on X 2 x ]0, +oo[ 
defined as follows: 

i \ ht n . r , n d{x,y) 

ht n + md(x,y)' " ' kt n +md(x,y) 

for all h, k,m,n G M + . Then (X, M,Af, *, 0) is an intuitionistic fuzzy metric space. 

Let (X,M,Af ,*,(}) be an intuitionistic fuzzy metric space. For t > 0, the open 
ball B(x, r, t) with center x G X and radius < r < 1 is defined by 

B{x,r,t) = {yEX: M(x,y,t) > 1 - r,Af(x,y,t) < r}. 

A subset A c X is called open if for each x G A, there exist t > and < r < 1 
such that B(x,r,t) C A. Let Tt M jj\ denote the family of all open subsets of X. 
Then T(m,N) i s called the topology on X induced by the intuitionistic fuzzy metric 
(M,Af). This topology is Hausdorff and first countable. 

Definition 1.6. [13] Let (X, M,Af, *, 0) be a fuzzy metric space. 

1) A sequence {x n } in X converges to x if and only if M(x n ,x,t) — > 1 and 

Af (x n , x, t) — > as n ^ co for each t > 0. 
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2) A sequence {x n } in X is called a Cauchy sequence if for any < e < 1 and 
t > 0, there exists n G N such that for all n > n , M(x n ,x m ,t) > 1 — e and 
Af (x n ,x m ,t) < e for each n,m> no-, M(x n , x, t) — > 1 asn^oo for all t > 0. 

3) The intuitionistic fuzzy metric space (X, M,Af, *,<£>) is said to oe complete if 
every Cauchy sequence is convergent. 

Definition 1.7. Lei (X, M, ,Af ,*,()) be a fuzzy metric space. M is said to be 
continuous on X 2 x (0, oo) if 

lim M(x n ,y n ,t n ) = M(x,y,t) 

n — >oo 

whenever {(x n ,y n ,t n )} is a sequence in X 2 x (0,oo) which converges to a point 
(x,y,t) e X 2 x (0, oo); i.e., 

lim M(x n ,x,t) — lim M(y n ,y,t) = 1 and lim M(x,y,t n ) — M(x,y,t). 

n — >oo n — »oo n — >oo 

Theorem 1.8. [13] Let (X, M, , Af, *, 0) be an intuitionistic fuzzy metric space such 
that every Cauchy sequence in X has a convergent subsequence. Then (X, M, , TV, *, 0) 
is complete. 

Proof. [13] D 

• Implicit Relation 

We denote by $, \& respectively, sets of all functions ip, tp : [0, l] 6 — ► [0, 1] such 
that 

(i) <f> S $, tp <G VP and </>, ^ are upper semi continuous in each coordinate variable, 

(ii) <f>, ip are decreasing in second and third variable, 

(hi) if either <f>{u, 1, u, v, v, 1) > or 4>(u, 1, u, v, 1, i>) > or (f>(u, u, 1, 1, v, v) > 
or for all u, v € [0, 1], then u > v. Furthermore, if either 

^(w, 0, w, v ,v , 0) < or ^(w, 0, u, v, 0, v) < or <j>(u, u, 0, 0, v, v) < for all 
u, v G [0, 1] , then u < v. 

Example 1.9. Let 4>(ti,t2,t 3 , £ 4 , t 5 , £ 6 ) = t\ — min{£ 2 , £3, t4,tz,,t & }. Then <f> £ $. 
Example 1.10. Let V(*ii ^2, £3, £4, £5, £e) = £1 — max{£2, £3, £4, £5, £g}. T/ien -0 € \P. 
Example 1.11. 

(p ({£ 2 , £3, £4, £5, £6}) = £1 - n(min{£ 2 ,£ 3 ,£4,£5,£6}) 

^({£2,£3,£4,£5,£6/) = £1 - V(max{£ 2 ,£3,£4,£5,£6}) 

where r\, ip : [0, 1] — > [0, 1] is a increasing and continuous function respectively, with 
77 (£) > £ and </3 (£) < £ /or < £ < 1. For example r\ (£) = \ft or n (£) = t h for 

< h < 1 and (£) = - . 

We need the following lemma of [10]. 

Lemma 1.12. Le£ {x n } be a sequence in intuitionistic fuzzy metric space (X, M, Af, *, {}) 
with M (x, y, t) — ► 1 and A/" (a;, y, £) — > as £ — > 00 /or all x,y € X. If there exists 
a number k €]0, 1[ smc/i £/ia£ 

M(x rl+ i,a;„,fc£) > M (x n , x„_i, £) , 

Af(x n+ i,x„,kt) < Af(x n ,x n -i,t). 

Then {x n } is a Cauchy sequence in X . 
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Lemma 1.13. [10]. Let (X, M, ,jV, *,()) be an intuitionistic fuzzy metric space. If 
there exists k G (0, 1) such that M(x, y, kt) > M(x, y, t) and N(x, y; ki) < N{x, y; t) 
for x, y G X , then x = y. 



Proof. See [10] 



□ 



2. Main Results 

Theorem 2.1. Let (Xi,Mi,Afi,0i,j i ) 1<i<) ben complete intuitionistic fuzzy met- 
ric spaces with Mj(x, Xi, t) — ► 1 and A/i (x, Xi, t) — ► as t — ► oo for all x, x% G Xi 
and let {Ai\ j Z 1 be n-mappings such that A{ : Xi — > Xj+i for all i = 1, .., n — 1 and 
A n : X n — > Xi, satisfying the inequalities, 

(2.1m) 

/ Mi (A n A n - 1 ..A 2 x 2 , A n A n _ 1 ..A 2 A 1 x l ,kt) , M x (xi, A n A n _ 1 ..A 2 x 2 ,t) ., 
<M Mi(xi,^ n A„_ 1 ..A 2 ^ixi,t),M 2 (x 2 ,Aixi,i), |,-() 

\ M 2 (x 2 , AiA„^„_i..A 2 x 2 , t) , M 2 (Aixi, A 1 A n ^„_ 1 ..A 2 x 2 , t) 

(2.1jv) 

/ A/i {A n A n _ x ..A 2 x 2 ,A n A n _ x ..A 2 A x x\,kt)M\ {xi,A n A n _ 1 ..A 2 x 2 ,t) . 
V>! A/i(a;i,^„A„_i..^ 2 Aixi,i),A r 2(x 2 ,Aixi,t), | <. 

\ A/" 2 (x 2 ,Ai^ nJ 4„_i..A 2 x 2 ,i) ,AA 2 (AiXi, A 1 A n A n - 1 ..A 2 x 2 ,t) 

for all xi € .Xi, x 2 G X 2 and t > 0, in general, we have 

(2.i M ) 

/ M, (^_iA l _ 2 ...Ai^„ J 4„_i..A i+ ia;i + i,A i _iA i _ 2 ...AiA„^ Il _i..A l x i ,fci), \ 

M i (xi,Ai_i J 4i_2...^4iA n A„_i.. J 4 i+ ia;i+i,*), 

Mi (x i ,A i _iA i _ 2 ...A 1 A n A n _ 1 ..A i Xi,t) ,M i+1 (x i+ i,AiXi,t) , 

M i+ i (xi + i, J 4i J 4j_ 1 ... J 4iA n A n _i..A i+ ix i+ i,t), 

M i+ i (A i x i ,A i A i -i...AiA n A n -i..A i+ ix i+1 ,t) 



\ 



>0 



(2-W) 



v>* 



/ A/* (Ai_ 1 A i _ 2 ...A 1 A n A n _ 1 ..A i+1 x i+ i,A i _ 1 A i _ 2 ...A 1 A n A n _ 1 ..A i x i ,kt) , \ 

A/i (x i ,A i - 1 Ai- 2 ...A 1 A n A n - 1 ..A i+1 x l+1 ,t) , 

A r i(x i , J 4i_iAj_ 2 ...A 1 ^4 nJ 4„_ 1 .. J 4 i Xi,i),A/'i + i(xi + i,AjXi,t), 

A/i+i (x,+i, A i A i -i...AiA n A n - 1 ..Ai + ix i+ i,t) , 

A/i+i (A i x il A l A l ^ 1 ...A 1 A n A n - 1 ..Ai +1 x i+ i,t) 



<0 



/or a// Xj G X^ Xi + i G -X,+i, £ > 0, where, ^ e $, ^, e $, i = 2, ...,n — 1 and 
< fc < 1. Again we have, 



(2.iim) 



/ M n (A n _iA n _ 2 ...A 1 xi,A n _ 1 A n _ 2 ...AiA n x n ,kt), \ 

M„ (ar n ,^4 n _iAi-2—-4ia;i,t) , 

M„ (x n , A n _\A n _ 2 ...AxA n x n , t) , Mi (xi, A n x„, i) , 

Mi (xi,A„A„_iA„_ 2 ...Aixi,£), 

V Mi(A l x„, J 4„A„_i,4„_ 2 ...AiXi,i) / 



>0 



(2.njv) 



^r. 



/ A4 (A n _iA n _ 2 ...A 1 xi,A n _iA n _ 2 ...AiA n x n ,kt) , \ 
A/"„ (x„, J 4 n _iA„_ 2 ... J 4ixi,t), 

A/ n (x„, j4„_iA rl _ 2 ...^4ij4„x„, t) , A/i (xi, A n x„, i) , 

M (a;i,Ai-4n-iAi-2—-4ia;i,t), 

\ Afi(A n x n ,A n A n -iA n - 2 ...Aixi,t) ) 



<0 
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for all xi £ X\, x n £ X n and t > 0, where <f> n £ $, ip n £ \I/ and < fe < 1. 

Further, suppose that {A 2 } is continuous on X; L Then, 

A i _ 1 A i _ 2 ..A 1 A n A n _ 1 ..A i 

has a unique fixed point pi £ Xi for i = 1, ...,n. Further, AiPi — Pi+\ for i — 
1, ...,n- 1 and 4p n =Pi- 



Proof. Let < xi > , i a;,- H , , \ xf- l ' \ , ...,<Xr > be sequences in Xi , 

I J r£N I > r£N I J r£N I J r£N 

X 2 , ...,Xi, ....X n respectively. Now let Xq be an arbitrary point in X\, we define 
the sequences < ij. > for i = 1, ..., n by 

x« = (A„A l -i...^i) r 4 1) - 
x^ = Aj_iAj_2--4ia;i 1) for i = 2,...,n. 
For n = 1, 2, ..., n we assume that x r 7^ xj,+i- Applying inequalities (2.1m) and 
(2.1a/-) for x 2 = j4iXr-i = x r-i' x i — ^ we get 

/ M 1 (xPKx™!, kt) , 1, Mi (x^, x^i,*) , 

^ ^ M 2 (^X^^X^,*) ,M 2 (^X^^X^,*) ,1 

M (4 1) ,4+i>^) ,0,M (aj^.^Si.t) , 

AA 2 (^i^L^ix^.tj ,AA 2 (Aufi, ^ix^,*) ,0 
From the implicit relation we have, 

(3.1m) Mr (a^.agi.ftt) > M 2 (xfl^xf\t) , 



V>i 



>0 



<0 



(3.1 



N) 



M (x« , xW, fct) < AA 2 (x^ , x^ , *) 



Applying inequalities (2.zm) and (2.ijv) for x,+i = Ai...A\ (A n ...A\ 
A^..A\x r _-y and Xj = Ai_i..Ai (A n ...A]) r Xq ' — Ai_\..A\Xr , we obtain 

' M i (^_ 1 ..A 1 x^^_i..A 1 x^ 1 ,to), N 



i*- 1 ^ 1 ) 







l,M i ^_i..i4ia£ 1) ,A i _i..i4ia£J 1 ,i 
,M i+ i fA,..A 1 x I ( , 1) 1 ,A 4 ..A 1 ^ 1) ,i 



>0 



1>i 



y M l+1 [A l ..A 1 x { rl 1 ,A l A l _ 1 ..A 1 x { r\Vj ,1 y 

M (x« , x^ , kt) , 0, M (x, (j) , sjji , *) 
,Af i+1 {x^!\xi i+1 \t) ,M +1 (x^U^,*) ,0 
From the implicit relation we obtain, 

(3.i M ) M t (x« , x^ , fct) > M i+1 (x^ , 4 4+1 ' , t) 

(3.ijv) Mi (xf , x% , kt) < M+i (x^ , x^ , i) 

for i — 2, ...,n — 1 and r £ N. 



<0 
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Now applying inequalities (2.tim) and (2.n^f) for x n — Xr and X\ = x r \ we 
have 



Mi (.W, x r (1) , tj , Mi (jW, x« , i) , 1 



>0 



yv n I Xr ,x r _|_^,/ctl ,U,yV n I Xr ,x r _|_^,tl , 

* nl M^^.tJ.M^.^tj.o l<0 

and so by implicit relation we have 

(3.n M ) M n (»W , 4+ } i , fct) > Mi (^-i , ^ . *) 

It now follows From (3.1m) , (3.«m) and (3, tim) that for large enough n 
MifoKx^kt) > M 2 (xfl 1 Mr\t) 

M^sW.^.t) > M i+1 ^ 1 1) ,4 i+1) .^) 

> ... 

> M It'"' r (n) — 



> Mi I x (1) a; (1) — 

> 



Mi I x {1) x {1) - 



> 



— ! rr+i-mn-liVi-nm)^„_i + i 

> rmn(M 1 f4 1 \4 1 \ 7 4rV"».^nfa : i n) ) 4 n) ,- ' 



£mn /''•■•' " 1 1 ' ^ ' fomn 

And it follows From (3.1/v) , (3.ijv) and (3, njv") that for large enough n 



M(4 .^i.t) < M + i(4-\M i+1) ^) 



< JVi [ x (1) ~ (1) * 

^ ... ^ ^vi i x r+i _ 2n _ 1 , 



r+i-2rn i.2n-*+l 
t 



< < AT, (x (l) x (1) 

— ■■■■ — • /v l I *r+i-mn-li •''r+i-mra' 



fcmn—i+l 



fcmn I ' '" " I ~i ' ^ ' £ 



Since < k < 1. It follows from Lemma 1.11 that < av f is a Cauchy sequences in 
Xi with a limit p, in JQ for i = 1, 2, .., n. 
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To prove that pi is a fixed point of Ai_\...A\A n ...AiPi, we can use proof by 
induction. Recently F. Merghadi and A. Aliouche sent an article about the existence 
of three fixed point in intuitionistic fuzzy metric spaces, which have been accepted 
for publication in the Journal of Mathematical and Computational Science. We have 
demonstrated for three spaces that Ai_\...A\A n ...AiPi — pi and AiPi — j>j+ifor 
i = 1,2,3. 

By induction, Supposing that Aj L _\...A\A n ...A i pj L — pi and AiPi — Pi+\ it true for 
all Ai e Xi and i = 1, 2, ..., n— 1. Now, we will going proved that Ai_\...A\A n ...AiPi 
— pi for all Ai € Xi and i = 1,2, ..., n 

Using the inequality (2Am) and (2.ijj) for Xi — Pi and cCj+i = » 
we obtain 



t'+ 1 1) =A l ...A 1 (A n ...A i y- 1 x { 1) 



i>i 



' Mi (x i ?\A i _ 1 ..A 1 A n ...A i p h kt),M i (p h x^\t) , N 

Mi (pi, A i _ 1 A i _2—AiA n A n _i..A i p i ,t) , 

M i+1 (x^ , A iPi> t) , M <+1 (a^j? , x ( ; +1) , t) , 

M i+1 (AiPi,x { j +1 \t) 



1 Mi (x < ?\A i _ 1 ..A 1 A n ...A i p i ,kt) ,Afi ( K ,xf l) ,t) , N 
Mi (p i ,A i _ 1 Ai_2...A 1 A n A n _ 1 ..A i pi,t) , 
M i+1 ft'+p , ^ , t) , M+i (4- + i 1} , * { r +1) , t 

M i+ i (AiPi,xl l+1} ,t) I 

Letting r — > oo we have 

Mi ( Pi , A i - 1 ..A 1 A n ...A i p i , hi) , 1, 
M i (p i ,A i _iA i _ 2 ...Ai^ n A n _i..^ i p i ,t), 1 M) 
M l+1 (p i+1 ,AiPi,t) ,l,M i+ i (AiPi,p l+1 ,t) 

Mi (pi, A i _ 1 ..A 1 A n A n _ 1 ...A i p i , kt) , 0, 
r ; \ Mi{p i ,A i _ 1 A i _ 2 -A 1 A n A n _ 1 ..A i p i ,t), | < 
M i+ i (p i+1 ,A l p i ,t) ,0,M i+ i (AiPi,p i+1 ,t) 

It follows from (iii) that 

(4.i M ) M % ( Pi ,A i _ 1 ..A 1 A n ...A iPi ,kt) > Mi+i (p i+ i, A iPi ,t) 



>0 



<0 



(4.i w ) A/* (p», A l ^i..A 1 A n ...A i p il kt) < M i+ i (p,+i, A i p l ,t) . 

for i = 2, ...,n — 1. By the same way we put x\ — p\, x 2 = a>-i m (2-1m) ((2.1/v - )) 
and i„ = p n , Xi = x r _ x in (2.71m) ((2.71^/")) respectively, we get 



M 1 { Pu A n ...A lPu kt) > M 2 ( P2 ,A lPl ,t) 
M n (p n ,A n _i..A 1 A n p n ,kt) > M 1 (pi,A n p n ,t) 



(4.1m) 
(4.iim) 



Mi(p!,A n ...A lPl ,kt) < M 2 (p2,A lPl ,t) 
M n {p n ,A n _ 1 ..A 1 A n p n ,kt) < M 1 (p 1 ,A n p n ,t) 



(4.1jv) 

(4.n w ) 
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We can write again from (4.1m) , (4.1m) and (4.tim), 

M 1 (p 1 ,A n ..A lPl ,kt) > M 2 (p 2 ,A lPl ,t) 

M 2 (p 2 ,A 1 A n ..A 2 p 2 ,kt) > M 3 (p 3 , A 2P2 ,t) 

M 3 (p 3 , A 2 A 1 A n ..A 3 p 3 , kt) > M 4 (p 4 , A 3P3 , t) 

M 4 (p 4 , A 3 A 2 A x A n ..A A p A , kt) > M 5 (p 5 , A 4 p 4 , t) 

M i (p i ,A i _ 1 ..AiA n ..A i p i ,kt) > M i+1 (p i+ i,AiPi,t) 



M n (p ni A n - 1 ..A 1 A n p ni kt) > Mi(p 1 ,A n p n ,t) 
Thus, by induction and (4.1m) , we have 
(5.i) A-^iPi = Pi+i for z = 1, 2, • • • , n - 1 

(5.n) A n p n = A n A n -ip n - 1 = ■ ■■ = A n ..A 2 p 2 = A n ..Axpi = pi 

and so, from (4.tim) we have 

An-i--A\A n p n = p n , 

Concerning inequalities (4.1_/v) , (4.i_v) and (A.nj\f). Similarly, we can prove that 

A i -i..AiA n A n -i..A i p i — pi for alH = 1,2, •• • , n. 

For proving the uniqueness of the fixed point pi in Xi we assume that there exists 
Zi G Xi such that z% ^ Pi and Ai_i..AiA n A n _i..AiZi = z% for all % = 1, 2, • • • , n. 
Firstly, using (2.im) for Xj = K and Xj+i = A;Z; we Zio^e, 

/ / i4 i _i..i4iA n i4 n _i..A i+ iA i ai, \ \ 

4 ^ Ai- 1 ..A 1 A n A n _ 1 ..A i p i ,kt )' 

M i (p i ,A i - 1 ..A 1 A n A n -i..A l+1 A l z u t) , 

Mi (p i ,A i _i..A 1 A n A n - 1 ..A i p i ,t),M i+1 (AiZi, AiPi,t) , 

M l+1 (A i z i ,A i ..A 1 A n A n _ 1 ..A i+1 A i Zi,t) ) 

M i+1 (A i pi,A i ..A 1 A n A n -i..Ai +1 AiZi,t) 



V 



>o 



and so, 



M % (zi ,pi, kt) , Mi (pi, Zi ,t), 1, 



/ 



>0 



M i+ i (A^ , AiPi, t) , 1, M l+ i (AiPi, A t Zi , t) 
which implies that, 

Mi (p u Zi , kt) > M i+1 (AiPi, AiZi , t) for alH = 1, 2, • • • , n 

From (5.i) , (5.n) ,we have: 

M 1 (px, zi , kt) > M 2 (AiPi, AiZi ,t) = M 2 (p 2 , z 2 , t) 



Mi (p u Zi , kt) > M i+1 (p i+ i, Zi+i ,t) for alH = 2, • • • , n - 1 



M n (p n ,z n , fci) > Mi (A n p n , A n z n , i) = Mi (p lf ^ , i) . 
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Similarly, we find by (2.ijv) that 

M 1 (p 1 ,z 1 ,kt) < M 1 (p 1 ,z 1 ,t) 

Mi fa, Zi , kt) < Afi+i (pi+i, Zi+i ,t) for alH = 2, • • • , n - 1 



M n (Pn,Z n , kt) < M„ (j>n, Z n , t) 

This proving the uniqueness of pi in X, for i = 1, 2, • • • , n. This complete the 
proof of the theorem. □ 

There is an example satisfies all conditions of Theorem 2.f 

Example 2.2. Let (X i; Mj, A/i, #i, 7j)i<,< n , ^ e n complete intuitionistic fuzzy met- 
ric spaces such that 



Mi (xi,yi,t) 



t 



Mi {xi,yi,t) 



\xi -yi\ 



ill I ' ' \ " on "I j- i I I 

i + |a;»-j/i| * + | ^i — 2/i | 

/or all i = l..n and X\ = [0, 1] , Xi = ]i — f , i[ for all i > 2. Define Ai : Xi — > Xj + i 
/or i = l..n — 1 and j4„ : X„ — > Xi fey 

i 3 i t */ x n <=]" — l,n.— -[ 

A^ = <{ - i/zi e [0, 1] , A„a;„ = <( 4 3 4 

1 if x n £ {n - -,n[ 



An 



% + - ifxi£ 


i 


-l,i- 
3 


3 

4 



for all i = 2..n — 1, 



Let(j) 1 = <t>2 = • • = <i> n = <t> such that (j) (ti,t2,t3,t4,t5,t 6 ) = t\— min{£ 2 , £3, £4, £5,^6/ 
andip 1 = 1^2 = ■■ = V'n — V' smc/i thatip (ti, t 2 ,t 3 ,t4,t 5 ,t 6 ) = £1— max {£2, £3, £4, £5, ie}- 
iVote i/iat i/iere exists uii in Xi such that {Ai-iAi-2--A\A n ...Ai) uii = Wj, Vi = 

l..n. (a,) 7/i — n we get (A n _iA n _2--AiA n ) w n — w n if w„ = n — — because 

(A n _ 1 A n _ 2 -A 1 A n ) ( n- - J = i4 n _iA n _ 2 ..i4 1 (1) 

1 



A n _ 1 A n _ 2 --A i+ i i + 



A n -]A n - 2 ( ra- 2 ) = A»-i ( « - - 



fej Remark that for all i = 1, n — 1 and Xj € 



t-j.t 



AiXi € 



(t + 1)- j,i + l 



t/ien i/iere exists Wi = i — — such that (Ai-iAi-2--A±A n ...Ai) \i — — j = i — — for 
all i — 1,2, ...,n — 1. Further, (Ai_iAi_ 2 --AiA n ...Ai) is continuous in Xi for all 
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i = 2, .., n because if x% = i — — is the point of discontinuity for Ai we have 

lim _ {A i _ 1 A i _2--MA n ..A i+1 Ai) Xi 

1 

4 
3 

4 



4 i _iA i _ 2 ..,4i J 4„..A i+ i I i 
A i _ 1 A i _ 2 ..A 1 A n ..A i+2 (i 



hm (A i - 1 A l -2--AiA n ..A l+1 Ai)x i 

= A,_ 1 i t _ 2 ..Ai4..Ai + i (i + - 

= i4 < _ii4 i _ 2 ..i4iA n ..i4 i+ 2 U + - 

If we take n = 2 in Theorem 2.1, we obtain a generalisation of Theorem 2.1 of 

[3]- 

If we put n = 4 in the Theorem 2.1 we get the following corollary which gener- 
alizes the Theorem 2.1 of [15]. 

Corollary 2.3. Let (Xi,Mi,Afi,6i,y i ) 1<i<i , be four complete intuitionistic fuzzy 
metric spaces with Mi(x,Xi,t) — ► 1 and A/i (x,Xi,t) — > as t — ► oo for all 
x, Xi G Xi and let {A i } i ~ 1 be 4-mappings such that A t : Xi — ► X i+ i for all i = 1, 2, 3 
and A4 : X4 — > Xi , satisfying the inequalities, 

I M i (A i _ 1 A i _2-A 1 A 4 ..A i+1 Xi +1 ,A i _ 1 A i _2-A 1 A 4 ..A i x i ,kt), \ 

Mi (xi, A i _iA i _2-AiA i ..A i+ iXi + i,t) , 

(6.i M ) & M i (a;i, J 4i_iA i _2..A 1J 4 4 ..A i a;j,i),M i+1 (a;i + i,^a;i,t), >0 

M i+ i (xi + i,AiAi-i..AiA4..Ai +1 Xi+i,t) , 

\ M l+1 (A,a; 4 ,A l A l _i..^i J 44..A t+ ia; 4+ i,t) / 



(6-ijv) ^ 



A/i (xj, A i -iAi-2--AiA i ..Ai + ix i+1 ,t) , 

Ni (xu A i -iAi-2..AiA 4: ..AiXi,t) ,A/i+i (a;i + i,^4,a;i,t) , 

A/i+i (xi + i,A i Ai_i.. J 4iAi..Ai + iXi + i,t) , 

A/i+i (A^i, A i A i ^ 1 ..A 1 A 4: ..Ai + iXi +1 ,t) 



<0 



/or a// x, € .X",, x,+i <G -Xi+i, t > 0, where, 4>i <G <&, ip i <E ^ , i = 1,2,3,4 anc 
< k < 1. Further, suppose that for all i — 1,4. {A 2 } is continuous on Xi Then 

A4A3A2A1 has a unique fixed point p\ G ATj 

A1A4A3A2 has a unique fixed point P2 € AT 2 

A2A1A4A3 has a unique fixed point P3 G AT3 

A3A2A1A4 has a unique fixed point p 4 G X4. 

Further, AiPi — Pi+\ for i = 1,2, 3,4. 
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Ki<4 



Proof. Let 



{,<•>} 



be sequences in {Xi} x 



<i<4 '■ 



point in Xi, we define the sequences < Xr f for % = 1, 2, 3, 4 by 

*« = (^S^ifa^, 



Now let Xq be an arbitrary 



,(3) 

3 (4) 






Ai.t 



(4) 



4 x t — 1 



Similarly of the proof in the case of three spaces, we can prove that < Xr > is 
Cauchy sequences in X t for all i — 1,2,3,4 with a limit 

pi = lim x] 1 - 1 , 

r — >oo 

P2 — lim x), ' = lim A\x\. ', 

r — >oo r — >oo 

P3 = lim xf' = lim A 2 x\. ' = lim A 2 Aix\, ', 



Pi = lim x ( r ] — lim As^Aix^ 1 ' = lim A 3 A 2 xf ) = lim A 3 x 



(•'!) 



>0 



To prove that Pi is a fixed point of Ai_\..A\Ai..AiPi for i = 1, 2, 3, 4 suppose that 
A i ^ 1 ..A 1 A 4 ..A l p i ^ pi. Using the inequality (6i M ) for x l = p l; Xj+i = x^\' = 
AiX*_i — Ai..Aix\.\ and t = l,2,3we obtain 

M i (A i _ 1 ..A 1 xl l \A i -i~AiA 4 ..A iPi ,kt), 
M i (p i ,A i _ 1 ..A 1 x^ ) ,ij, 

M % ( Pl ,A i - 1 ..A 1 A 4 ..A iPl ,t),M l+1 ^..Axx^AiPi,^ 

Mi+i (^..AiasWi.^i-^iaJr ,* 

M l+1 (A 4 p J ,A,..A 1 ^ ) ,t) 

' M ^ i) ,i4 i _i..Aii4 4 ..i4 i p i ,fct) ,M (pi,a£\t) . \ 

M i (p i ,i4 i _i..i4iAii4 3 ..i4 i p i ,t) ) 

M 4+1 (a^.^Pi.t) ,M i+1 (x^.^+^.t) , 

M 2+1 (A lPl ,^ +1) ,t) 



and so, 



(7.i M ) 



for all i = 1, 2, 3 and 0J e $. 



>0 



,« 



In the case where i = 4, we pose that 24 = 754 and x\ — x r , then we have, 
/ M 4 (xf 4) , A 3 A 2 A 1 A 4 p i , kt) ,M 4 (j^.a^.t) , N 



(7.4 M ) 



V 



M A (p 4 , A 3J 4 2J 4iA 4 p4, t) , Mi ^ 1} , A 4 p 4 , t 
Afi (x, (1) , jW , t) , Mj [A APi , 4+i , t) j 



>0 
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Letting r — > oo in (7Am) and (7.4m) we have, 

M % (p h Ai_ 1 ..A 1 A 4 ..A i p i , kt) , 1, 
M i (p i ,A i _ 1 ..A 1 A 4 A 3 ..A iPi ,t), | M) 

M m (p i+ i,AiPi,t) , l,M i+1 (AiPi,p l+1 ,t) 

and 

M 4 (p 4) ^3^2AiA 4 p 4 , t) , Mi (pi, A 4 p 4 , t) , I ^ 
l,Mi(i4P4,Pl,t) 

It follows from (iii) that 

(8.i M ) M % {p ii A i _ 1 ..A l A A ..A i p i ,kt) > M l+1 { Pi+ll A lPll t) 

for all i — 1, 2, 3, 4 which mean again, 

M l {p l ,A 4 A 3 A 2 A l p l ,kt) > M 2 (p 2 ,A lPl ,t) 
M 2 (p 2 ,A 1 A 4 A 3 A 2P2 ,kt) > M 3 (p 3 ,A 2 p 2 ,t) 
M 3 ( P3 , A 2 A!A 4 A 3P3 , kt) > M 4 (p 4 , A 3P3 , t) 
M 4 (p 4 , A 3 ^ 2 ^i A 4 p 4 , fct) > Mi (pi, A 4 p 4 , t) 
Suppose that A 2 is continuous. Then, from (8.2m) we get, 

A 2 p 2 = p 3 
A 1 A 4 A 3 A 2 p 2 = p 2 
AxA 4 A 3 p 3 = p 2 
A 2 AiA 4 A 3 p 3 = A 2 p 2 = Pa- 



Using the inequality (6.«m) for Xi — Xr = A;_i..^4ia;r ± '' and a;j+i 



/ 



(9.i M ) 4 






Mi ^A i _i..i4iAi..i4 i+ ip i+ i,a;^ 1 ,fct 
M, (x? ) ,A i _ 1 ..A 1 A 4 ..A i+1 p i+1 ,t) ,M t (^,4+1.*) . 
M i+ i [p i+ i,x ( r ,t] ,M i+1 (pi +1 ,A i ..A 1 A 4 ..A i+1 p i+1 ,t) , 
A i ..AiAi.. J 4i + iPi+i,t 



(8.1m) 
(8.2 m ) 
(8.3m) 
(8.4 m ) 

(9.1) 
(9.2) 
(9.3) 
(9.4) 

Pi+i we have 

\ 



>0 



„(i+l) 



a:£_2i,fct) ,M 3 (4 3) , A 2 A 1 A l p 4 ,t > ) , \ 



M l+1 (»<! 

M 3 f A 2 A 1 A 4 p 4 , A, r+1 , 

M 3 (xl 3) , a^i, *) , M 4 [p 4 , xi 4) , t) 

^ M 4 (p 4) ^3^2-4lA 4 p4, t) , M 4 (xl 4) , i4 3 i4 2 i4iA t P4 ) t) y 



/ 



>0 



M 4 (^3^2 Aipi, 4+i, A* 

M 4 (4 4) , A 3 ^ 2 ^iPi , t) , M 4 (xl 4) , x { r % 7 1) 

Mi fpi, a;^ i) , M x (p!, A 4 A 3 A 2 A lPl , t) . 



V 



Mi (xQvAiAsAzAw^t) 

Letting r — > oo in (9.«m) we have 

/ M i (A i _ 1 ..A 1 A 4 ..A l+1 p i+1 ,p i ,kt) , \ 
M i (p i ,A i _ 1 ..A 1 A 4 ..A i+1 p i+1 ,t), 1,1, 
91 M i+1 (p i+1 ,A i ..A 1 A 4 ..A i+1 p i+1 ,t) , 
\ M i+1 (p i+1 ,A l ..A 1 A 4 ..A i+1 p i+ i,t) J 



>0 



>0 
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which it means, 



Mi (i44i43i42P2.pi, kt) ,Mi (pi,i4 4 i4 3 i42P2,«) , 1, 1, 
M 2 (p 2 , i4 1 i4 4 i4 3 i42P2, t) , M 2 (p 2 , i4ii4 4 i4 3 i42P2, t) 



>0 



>0 



>0 



>0 



(10.1m) 
(10.2m) 
(10.3m) 
(10.4m) 



M 2 (i4 1 i4 4 i4 3 p3,p2, kt) , M 2 {p 2 ,A 1 A 4 A 3 p 3 , t) , 1, 1, 
M 3 (p 3 , i4 2 i4 1 i4 4 i4 3 p 3 , i) , M 3 (p 3 , i4 2 i4 1 i4 4 i4 3 p 3 , t) 

M 3 (i4 2 i4 1 i4 4 p 4 ,p 3 , fct) , M 3 (p 3 , A 2 ^i ^4P4, t) , 1, 1, 
M 4 (p 4 , i4 3 i4 2 i4ii4 4 p 4 , t) , M 4 (p 4 , i4 3 i4 2 i4ii4 4 p 4 , t) 

M 4 (i4 3 i4 2 i4 1 p 1 ,p 4 , kt) , Mi (p 4 , i4 3 i4 2 i4 1 p 1 , t) , 1, 1, 
Mi(pi,i4 4 i4 3 i4 2 i4ip 1 ,t), 
M 1 (pi,i4 4 i4 3 i4 2 i4 1 pi,t) 

It follows from (iii) that: 

M 4 (i4 4 i4 3 p 3 ,pi, fct) > M 2 (pa, i4ii4 4 i4 3 p 3 , t) , 
M 2 (i4ii4 4 i4 3 p 3 ,p2,fei) > M 3 (p 3 ,i4 2 i4ii4 4 i4 3 p 3 ,t) 
M 3 (i4 2 i4 1 i4 4 p 4 ,p 3 , kt) > M 4 (p 4 , i4 3 i4 2 i4 1 i4 4 p 4 , i) 
M 4 (i4 3 i42i4 1 p 1 ,p4,fci) > M 1 (p 1 ,i44i4 3 i4 2 i4 1 p 1 ,i) 

Then, from (9.1) , (9.2) and (10. 1 M ) we have 

(11.1) i4 4 i4 3 p 3 =p 1 . 
From (9.4) , (10. 2 M ) and (11.1) we get, 

(11.2) i4iP!=p 2 

From (10. 4 M ) , (11.1) and (11.2) we obtain, 

-4-3P3 = Pi 
i4 4 A 3 p 3 = i4 4 p 4 = pi 

Now, substitute (11.2) and (11.4) in (7.4m) and (8.4m) respectively, we get 

i4 4 i4 3 i4 2 i4ipi = pi 

A 3 A 2 A 1 A 4 p 4 = p 4 

Concerning inequalities (6.ijv") , we can prove by the same manner that 

Ai-\..A\Ai..Aipi — Pi for all i = 1,2,3,4. 

For proving the uniqueness of the fixed point pi in Xi we assume that there exists 
Zi G Xi such that z% ^ Pi and A{-\..A 4 A 3 ..AiZi — z% for all i — 1,2,3,4. Firstly, 
using (6.«m) f or x i — Pi and Xi + \ = A; t Zi we have, 

( M l (A l - 1 ..A 1 A 4 ..A l+1 A l z l ,A l - 1 ..A 1 A 4 ..A l Pi,kt), \ 

Mi (p i ,A i - 1 ..A 1 A i ..Ai +1 A i Zi, t) , 

M i (p i ,A i _ 1 ..A 1 A i ..A i p h t),Mi +1 {A i Zi,A iPi ,t), 

M i+1 (A i z i ,A i ..A 1 A 4 ..A i+1 A i z i ,t), 

Mi +1 (A i p i ,A i ..A 1 A A ..Ai +1 A i Zi,t) 

and so, 

M % (zi ,pi, kt) , Mi fa, Zi ,t), 1, 
M i+ i (AiZi , AiPi, t) , 1, M l+1 (AiPi, A t Zi , t) 
which implies that, 



(11.3) 
(11.4) 



V 



>0 



/ 



>0 



Mi ( Pi , Zi , kt) > M l+1 (A iPi , AiZi , t) for all i = 1, 2, 3, 4 
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From (9.1) , (11.2) , (11.3) and (11.4) we have: 

Mi(pi,zi,fct) > M 2 (A lPu A 1 z 1 ,t) = M 2 (p 2 ,z 2 ,t) 

M 2 (pa ,z 2 ,kt) > M 3 (A 2 p 2 ,A 2 z 2 ,t) = M 3 (p3,z 3 , t) 

M 3 (pa , z 3 , fct) > M 4 ( A 3 p 3 , A 3 z 3 , i) = M 4 (p 4 ,z 4 ,t) 

M 4 (p 4 , z 4 , kt) > Mi (A 4 p 4 , A 4 z 4 ,i) = M 1 (pi,z 4 , t) 

which implies again that, 



M 1 ( Pl ,zi 


,kt) 


> 


M 1 (p 1 


Z\ 


.*) 


M 2 (p 2 ,z 2 


,kt) 


> 


M 2 {p 2 


Z<1 


>*) 


M 3 (p 3 ,z 3 


,kt) 


> 


M 3 (p 3 


Z3 


>*) 


M 4 (p 4 ,z 4 


,kt) 


> 


M 4 (p 4 


z 4 


,*). 



Similarly, we find by (6.zjv) that 



M (pi,Z! ,kt) 
M 2 {p 2 ,z 2 ,kt) 
A/" 3 (p 3 ,z 3 ,kt) 
J\f 4 (p 4 ,z 4 , kt) 



< Af 4 (pi,zi ,t) 

< M 2 {p 2 ,z 2 ,t) 

< AA 3 (p 3 ,z 3 ,t). 

< N 4 (p 4 ,z 4 ,t) 



This proving the uniqueness of Pi in Xj for i = 1,2,3,4. This complete the proof 
of the theorem. □ 



The following example illustrates our Corollary 2.3. 

Example 2.4. Let X 4 = [0,1] ,X 2 = [1,2[,X 3 =]2,3],X 4 =]3, 4 ] andlet M l (x i ,x i+1 ,t) 

: r for all i = 1,3 and M5 (X5, cci, i) = ; r. Define A\ : X\ — > 

i+|a; i+ i-Xj| * + |a?5 — 

X 2) A 2 : X 2 ->■ X 3 , A 3 : X 3 ->• X 4 and A 4 : X 4 ^ Xi by 



A1X1 



A 3 x 3 



1 if xi G [0, -[ f 5 

1 i/a^e [|,i] L 2 

T^s€]2,|[ 

7 5 

2 i/^f [-,3] 



A 4 x 4 



3 •/ 

- z/ x 4 e 


N 


1 i/.x4 e 


.H 



lef <j) 1 (t 1 ,t 2 ,t 3 ,t 4 ,t 5 ,t 6 ) = t x - mm{t 2 ,t 3 ,t 4 ,t 5 ,t 6 } and <$> x = 4> 2 = cf> 3 = 4 . 
_ffere -X"i is compact, but the others spaces are not compact. Further the inequalities 
(5.1) , (5.2) , (5.3) , (5.4) , (5.5) are satisfied since the left hand side of each inequality 

3 
is 1 and A 4 A 3 A 2 A\ is continuous in X\ because if x — - , the point of discontinuity 
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for A\ , we get 

lim_ A 4 A 3 A 2 A lX = A A A 3 A 2 (1) = A 4 A 3 (3) = A 4 (-j = A 5 (^ 

and 
lim + A i A 3 A 2 A 1 x = A A A 3 A 2 ( - J = A±A 3 (3) 



PFe ho 



A i A 3 A 2 A 1 {l) = 1, A^A^A^A-i. 



3\ _ 3 

2/ ~ 2' 
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A MODIFIED PARTIAL QUADRATIC INTERPOLATION 
METHOD FOR UNCONSTRAINED OPTIMIZATION 

T.M. EL-GINDY, M.S. SALIM, AND ABDEL-RAHMAN IBRAHIM 



Abstract. A numerical method for solving unconstrained optimization prob- 
lems is presented. It is a modification of the partial quadratic interpolation 
method [11] for unconstrained optimization and based upon approximating 
the gradient and the Hessian of the objective function. This means that it 
requires only the expression of the objective function to converges to a sta- 
tionary point of the problem from any initial point, with speed convergence. 
The method can solve complex problems in which direct calculations of the 
gradient and Hessian matrix are difficult or even impossible to calculate. The 
search directions are always descent directions. Results and comparisons are 
given at the end of the paper and show that this method is interesting. 



1. Introduction 

In this paper, we consider the unconstrained optimization problem 

(1.1) min/(a;), i£l„ 

where / : R„ — > R is a continuously diffcrcntiable function, and its gradient at 
a point x r , r € N, is denoted by g(x r ), or g r for simplicity, n is the number of 
variables. One of the most effective methods for solving the unconstrained problem 

(1.1) is the Newton method. It normally requires the fewest number of function 
evaluations, and is very good at handling ill-conditioning. However, its efficiency 
largely depends on the possibility of solving efficiently a linear system which arises 
when computing the search direction d r at each iteration, 

(1.2) H(x r )d r = -g{x r ) 

Where H(x r ) is the matrix of second partial derivatives (Hessian matrix) of / and 
d r is a search direction in the current iteration. Moreover, the exact solution of 
the linear system (1.2) could be too burdensome, or is not necessary when x r is far 
from the solution of / (see [25, 27]). 

It is emphasized [3] here that unless [H{x r )\ is positive definite, the direction 
— [H(x r )]~ l [g(x r )] will not be that of descent for the objective function, to see this 
we substitute the direction into the descent condition to obtain 

(1.3) -[g{x r )] T [H(x r )]- l [g{x^)]<Q 

The foregoing condition will always be satisfied if [H(x r )] is positive definite. If 
[H(x r )\ is negative definite or negative semidefinitc the condition is always violated. 
With [H(x r )\ as indefinite or positive semidefinite, the condition may or may not 



Key words and phrases. Unconstrained optimization, descent direction, partial quadratic in- 
terpolation method. 
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be satisfied, so we must check for it. If the direction is not descent for the objective 
function, we should stop there because a positive step size cannot be determined. 

There exist some kinds of effective methods available for solving (1.1), as for 
instance, inexact Newton, limited memory quasi-Newton, truncated Newton (TN), 
conjugate gradient, spectral gradient, and subspace methods. Inexact Newton 
methods (see [10, 25]) represent the basic approach underlying most of the Newton- 
type large-scale unconstrained algorithms. At each step, the current estimate of the 
solution is updated by approximately solving the linear system (1.2) using an iter- 
ative algorithm. The inner iteration is typically " truncated " before the solution 
of the linear system is obtained. 

In Quasi-Newton methods the Hessian matrix of second derivatives of the func- 
tion to be minimized does not need to be computed. The Hessian is updated by 
analyzing successive gradient vectors instead. Quasi-Newton methods are a gener- 
alization of the secant method to find the root of the first derivative for multidimen- 
sional problems. In multi-dimensions the secant equation is under-determined, and 
quasi-Newton methods differ in how they obtain the solution, typically by adding 
a simple low-rank update to the current estimate of the Hessian. 

The limited memory BFGS (L-BFGS) method (see [20, 26, 33]) is a member 
of the broad family of quasi-Newton optimization methods. It use a low and pre- 
dictable amount of storage, and only require the function and gradient values at 
each iteration and no other information about the problem. They are suitable for 
large scale problems because the amount of storage required by the algorithms ( 
and thus the cost of the iteration ) can be controlled by the user. Alternatively, 
limited memory methods can be viewed as implementations of quasi-Newton meth- 
ods, in which storage is restricted. Their simplicity is one of their main appeals: 
they do not require knowledge of the sparsity structure of the Hessian, or knowl- 
edge of the separability of the objective function, and they can be very simple 
to program. Limited memory methods originated with the work of Perry (1977) 
and Shanno (1978), and were subsequently developed and analyzed by Buckley 
(1978), Nazareth (1979), Nocedal (1980), Gill and Murray (1979) and Buckley and 
LeNir (1983). Numerical tests performed on medium size problems have shown 
that limited memory methods require substantially fewer function evaluations than 
the conjugate gradient method, even when little additional storage is added. The 
implementation is almost identical to that of the standard BFGS method, the only 
difference is that the inverse Hessian approximation is not formed explicitly, but 
defined by a small number of BFGS updates. It often provides a fast rate of linear 
convergence, and requires minimal storage. 

If only gradients are available and memory is limited, one may use a discrete 
truncated Newton method (DTN) such as in [10, 26]. DTN uses gradients to 
approximate the product of the Hessian with an arbitrary vector. Several attempts 
have been made to create a method which combines the properties of the (discrete) 
truncated Newton method and the L-BFGS method. 

Conjugate gradient methods have played special roles in solving large scale non- 
linear optimization problems. Although conjugate gradient methods are not the 
fastest or most robust optimization algorithms for nonlinear problems available to- 
day, they remain very popular for engineers and mathematicians with solving large 
problems [13, 17, 22, 36, 37]. The convergence properties of conjugate gradient 
methods have been studied by many researchers [1, 8, 9, 15, 16, 28, 31]. Good 
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reviews of the conjugate gradient methods can be found in [13, 18]. Although all 
these methods are equivalent in the linear case, that is, when / is a strictly con- 
vex quadratic function and a r is computed by an exact line search, their behavior 
for general functions may be quite different. The search direction in all nonlinear 
conjugate gradient methods is given by d r = —g r + [3 r d r ~ 1 ,with d° = —g° and ff 
being a scalar. Most of the recent work on nonlinear conjugate gradient methods 
is focused on the design of a new f3 r or a new line search strategy. The large- 
scale unconstrained optimization problems have received much attention in recent 
decades. We refer to [2, 18] for a good survey. 

Spectral gradient methods have proved to be of great value in unconstrained 
optimization problems. They were introduced by Barzilai and Borwein [4], and 
analyzed by Raydan [29] . They have been applied to find local minimizers of large 
scale problems [5, 6, 30], and also to explore faces of large dimensions in box- 
constrained optimization see [12]. More recently, spectral gradient methods were 
extended to minimize general smooth functions on convex sets See [7] . In this case, 
the spectral choice of step length was combined with the projected gradient method 
[14, 19, 34] to obtain a robust and effective low cost computational scheme. 

Subspace techniques [32, 35] used in constructing of numerical methods for non- 
linear optimization. The subspace techniques are getting more and more important 
as the optimization problems we have to solve arc getting larger and larger in scale. 
The applications of subspace techniques have the advantage of reducing both com- 
putation cost and memory size. Actually in many standard optimization methods 
(such as conjugate gradient method, limited memory quasi- Newton method, pro- 
jected gradient method, and null space method) there are ideas or techniques that 
can be viewed as subspace techniques. The essential part of a subspace method 
is how to choose the subspace in which the trial step or the trust region should 
belong. 

The iterative formula of these methods is given by 

(1.4) x r+l =x r + a r d r 

where a r > is a step length. Generally [3], we can say that d r is a descent 
direction of / at x r if 

(1.5) g r -d r <0 

The classical Newton's method is the basis of many numerical methods for un- 
constrained optimization problems. If, in the case of minimization, the Hessian 
matrix of the function under consideration is positive definite, then a numerically 
stable method for forming the descent direction is to factorize the Hessian matrix by 
the method of cholcsky. In cases where the matrix is singular or, more commonly, 
indefinite, Newton's algorithm is no longer a steepest descent algorithm. In addi- 
tion the method of cholesky is no longer numerically stable, even if the factorization 
of Hessian exists, there have been several algorithms [23] which replace the Hessian 
matrix in Newton's method by a related positive definite matrix. The disadvan- 
tages of such methods is that either they cease to determine the descent direction 
in a numerically stable way, and/or the amount of work necessary to evaluate this 
direction is considerably greater than that required by cholesky's method. 

The partial quadratic interpolation method [11] or shortly (P.Q.I.) technique 
may be considered as a second-order method, the principal idea of this method is 
to approximate the objective function f(x) about certain point x r by second degree 
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polynomial in the space R„, from which one may determined approximations to the 
gradient and Hessian of this function as [6„(x r )] and [j4 n (i r )] respectively. One then 
extract a positive definite matrixL4p(a; r )] from the Hessian matrix [A„(x r )] in the 
subspace K p C R„, using a particular application of the cholesky technique. 

PQI technique. 

Step 1. Choose some guessed starting point x r G M. n . 

Step 2. Approximate the function f(x) about x r in a quadratic form: 

f(x) ~ P (x) = a+ [b n {x r )] T [x - x r ] + ±{x - x r ] T [A n (x r )][x - x r }. 

the constant a; and the matrices [6„(x r )] and [A„(a; r )] are computed using certain 
interpolation points [11]. 

Step 3. Use the necessary condition for the minimum of p(x) we have 

x r+i = x r - [^oor'MO] 

or 

[A n (x r )][A] = -[b n (x r )] 
where 

[A] = x r+1 - x r 

Step 4- Extract the symmetric positive definite matrix [A p ] from the symmet- 
ric matrix [A n ] using Choleski's method, where: 

[A P ] = [L P ][L P ] T 

Step 5. Solve the system of the equations: 

[L p ][L p ] T [Ai] = [b p ], A t eM p 

which is equivalent to the solution of the two systems of equations 

[L P }[Y] = [b p ] 

[L P ] T [K] = [Y]. 
Step 6. The new point, or the next guess is then given by: 

x[ +1 = x\ + aAj for Xi ER p , < a < 1 

and 

x\ +1 = x\ for Xi fi R p 

Step 7. To determine the new point x r+l G R„, we try a — 1, ^, j, ... and 
take the first value of a which satisfies f(x r+1 ) < ,f{x r ), x r e M. n . If a becomes 
sufficiently small without satisfying this condition, we use new interpolation points, 
and restart the calculation, 

The method has many advantages: 

• It avoids direct calculations of the gradient and Hessian matrix, which are 
difficult or even impossible to calculate in certain problems. 

• It avoids evaluation of the inverse matrix needed at each iteration. 

• It ensures convergence to a local minimum, regardless of the choice of initial 
value. 
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But in some problems it has two disadvantages: 



• The fixing of the variables Xj G K n _ p may cause the convergence to be 
slow. 

• The operation of canceling certain rows and columns may at certain point 
involves all the rows and columns of the Hessian matrix A n , and then, the 
procedure stop before achieving the solution. 

To overcome these disadvantages we suggest a modification to the (P.Q.I.) tech- 
nique. The modification is to complete the direction obtained from the (P.Q.I.) 
technique from a certain vector, which is a descent vector, and this produce a 
faster descent direction converges to the minimum value of the objective function 

/• 

This paper is organized as follows. In Section 2, we describe the modified par- 
tial quadratic interpolation technique . Numerical results and one conclusion are 
presented in Section 3 and in Section 4, respectively. Throughout the paper, || • || 
denotes the Euclidean norm of vectors. 



2. Modified P.Q.I, method 

In this section, we propose a new algorithm to solve (1.1). This algorithm gen- 
erates a sequence of points x r by 



a r d r 



0,1,2, 



where d r is a descent direction of / at x r , and a r is the step length which is 
determined by a line search. In the following, we describe the method in details. 

Let f(x),x G K n be a continuous function, [g n (ai r )] is a column matrix repre- 
senting the gradient vector of / at x r , and [H n (x r )} is the matrix of second partial 
derivatives (Hessian matrix) of / evaluated at the point x r . Let [H p (x r )] is the 
positive definite matrix extracted from \H n (x r )] and [<7 p (x r )] is the corresponding 
gradient vector of f(x) in the sub-space M p C R„. Define \g n - p (x r )] as the gradient 
of f{x) with respect to the variables Xj ^ R p or Xj G (R„ — R p ), where, 
[x p ] is the column vector of the components x^ £R p . 
[a^n-p] is the column vector of the components Xj ^ R p or Xj G (M„ — K p ). 

The P.Q.I, method have a search direction in the form — [H p (x r )]~ 1 [g p (x r )], and 
we propose the search direction to be, 

[iW)]-% P (z r )r 

-c[g n - p {x r )\ 



where, 



ll[ffn(^)]|| 2 /ll[^(^)]bn(^)]|| 2 , M* r )] = M* r )] - M* p_1 )] 



if r = 0, 
if r = l,2, 



This mean that in first iteration c = 1 and in the subsequent iterations c 

\\[ 9n (x r )}\\ 2 /\\[Hn(xn][qn(x r )}\\ 2 , 

[q n (x r )} = [g n (x r )] — [g n (x r 1 )}. We note that d r is a descent direction since, 
satisfies (1.5) i.e. 



[g n - P (x r )] 



-[H p (x r )]- l [g p {x r )\ 
-c[g n _ p (x r )] 



<0 
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Theorem 2.1. Let f : R„ — > R, x' r G IR„ and x r is not a critical point and, If 
the Hessian matrix [H p {x r )\ is a positive definite matrix. Then the direction 

r-[# P (* r )]- 1 [s P (* r )]i 



d n (x r ) 



-c[g n - P (3 



where, 

1 i/r = 0, 

JIM* r )]llVll[ff»(* P )][«»0O]f> [g„(^)] = M*")] - [»n(^- 1 )] tfr = l,2,... 

is a descent direction. 

Proof. Approximating f(x r + td n (x r ) by using Taylor's series expansion about the 
point x r , we get 

f(x r +td n (x r )) = f(x r ) + t[g n (x r )] T [d n (x r )] + 0(t 2 ) 



f(x r )+t 



[g n -p(x r )\ 



-c[g n - p (x r )] 



+ 0(f 



(2.1) = f{x r )-t{[g p {x r )] T [H p {x r )]-\g p {x r )]+c[g^ v (x r )] T [g n ^{x r )]} + 0{t 2 

If t > is small, then the error term 0(t 2 ) will be neglected, 
therefore, eq. (2.1) take the form 

(2.2) f(x r +td n (x r ))-f(x r ) = -tilgAxlflHAxir'igri^Mgn-Axlflgn-rixl}} 

but [H p (x r )] is a positive definite matrix, then 

-[g p (x r )] T [H p (x r )}- 1 [g p (x r )}<0 

also c[g n - p (x r )] T [g n - p (x r )}, and £ are always positive. Then the right-hand side of 
equation (2.2) is negative, so that: 

f(x r + td n (x r ))<f(x r ) 

Then d n {x r ) is a descent direction. □ 

Let [6„(a; r )] is a column matrix (bi) representing the approximate value of the 
gradient [g n (x r )\, and L4 T j(x r )] is a square matrix (<%•) of order n representing 
the approximate value of the Hessian \H n (x r )]. If we use the approximate values 
[A„(x r )] and \b n (x r )] of the Hessian and gradient, we get the same result. 
Using these approximate values, we can write 

(2.3) f(x r + td n (x r )) = f(x r ) + t[b n (x r )} T [d n (x r )} + v(t, [£]) 

where r/(t, [£]) is the error term, and [I] — {£\,£i, ■■■An)- 
In the same way equation (2.1), becomes 

f(x r + td n (x r )) = f(x r ) - t{[6p(a; r )] r [^ ) (a: r )]- 1 [6p(a; r )] 

(2.4) + c[b n _ p {x r )] T [b n ^ p {x r )]} + r,{t,[f]) 

As before the error term will be neglected when t and [£] are small, and hence 

(2.5) f(x r +td n (x r ))<f(x r ). 

Let f{x) be defined over the space R„, then the approximation L4 rl (:E r )] of the 
Hessian is not, in general strictly positive definite. Now we extract a positive 
definite matrix [A p (x r )] from the Hessian matrix [A„(a; r )], and hence we construct 

-[A.OOr'M*'")] 



the vector 



-c[b n - p (x r )} 



which is from our previous theorem a descent 
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direction. This direction generates a sequence of points x l ^x 2 1 ...,x r , ... converges 
to the optimum solution from any initial point x . 

Now, we state the steps of the modified partial quadratic interpolation (modified 
P.Q.I.) algorithm as follows. 

Algorithm 2.2. 

We assume that f(x) is a given continuous, scalar valued function, where x € K n 
and we seek to determined the point x* such that: 

/(#*) < f(x) for all x near to x* 

Step 1. Choose an initial point x° G R„. Select a convergence parameter e > o 
and set r = 0. 

Step 2. Approximate the function f(x) about x r in a quadratic form: 

f(x) ~ a + [b n (x r )] T [x - x r ] + ±[x - x r ] T [A n (x r )][x - x r }. 
Step 3. Calculate the matrices [b n (x r )] and [yl„(x r )] by the following relations, 

/«+)-/«_) 
bj = 

_ f(xl) + f(x r l _)-2f(x r ) 

a ii ff2 



and, 
Where, 





fK) 


-f(x r i+ )-f(x r j+ )+f(xn 


a t j 




t-itj 


+ 4 




Q>ii r /^ Q-rj" r J 



f" 



r / r r r \ 

X \%li %2> "•' *n/' 

Xj + V-^l j a; 2! •"' ^i — 1j •£» ' *•») ^i+li •") "^n/' * t, Z, ..., n, 

r ( r r r v g r r \ ■ 10 

x i_ — v^li ^2 7 ■■■! *»— li ■*'» **J ^i+li ■"J •^nJi * J-) -^) •••> "> 

• T ij = (2-1 j 2-2 J ••• J "E»— 1; -^i "N-ij 2-i+lj •■•) Xj+tj, Xj+i, ..., £ n J, i = 1, Z, ..., 71— 1, J — i + 1, ..., fl. 

S'iep ^. If ||[&n(^ r )]|| < £) then stop as x* — x r is a minimum point. Otherwise 
go to the next step. 

Step 5. Extract the symmetric positive definite matrix [A p ] from the symmet- 
ric matrix [A n ] using Choleski's method, where: 

[Ap\ = [Lp\ [Lp\ 

and 

[A] = -L4 P ]"%] 
which is equivalent to: 

[A p }[A] = -[b p ]. 
Step 6. Solve the system of the equations: 

[L p ][L p ] T [A] = [b p ], AeR p 
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which is equivalent to the solution of the two systems of equations 

[L p ][Y] = [b p ] 

[L P } T [A} = {Y]. 
Step 7. Determine the search direction at the current point x r by 

[A] 

-49n-p(x r )} 



if r = 0, 

[b n (x r )]\\ 2 /\\[A n (x r )][q n (x r )]\\ 2 , [q n (x r )] = [b n (x r )] - M* r_1 )] if r = 1,2,. 

Step 8. Find the optimal step length a r in the direction d r . A one dimensional 
search is used to determined a r . 

Step 9. Set x r+1 = x r + a r d r . Set r = r + 1 and go to Step 3. 



3. Numerical Results 

The main aim of this section is to report the performance of Algorithm 2.2 on a 
set of test problems. The codes were written in Fortran77 and in double precision 
arithmetic. All the tests were performed on a PC by using the exact line search. 
Our experiments arc performed on a set of 26 unconstrained problems. We test our 
method with the result given in [21]. 

For [21] we give the numerical experiments of MP.Q.I against the original BFGS 
formula with Amijo line search and MBFGS proposed by Liying Liu, Zengxin Wei 
and Xiaoping Wu. The problems that we tested are from [24]. We will stop the 
program if the inequality 1 1 t»(cc ,, ~) | ] < 1CT 4 is satisfied. The computation results are 
given in Table 1. 

where the columns have the following meanings: 

Problem: the name of the test problem in [24] ; 

Dim: the dimension of the test problem; 

NF: the total number of the function evaluations; 

NI: the total number of iterations. 

From Table 1, we can see that the modified P.Q.I, method in the paper works 
well. On one hand we see that the total number of iterations in MP.Q.I. method 
is less than that of both methods for almost all the problems. We emphasis here 
that the optimum function values which obtained from our method is a very good 
approximation to the exact solution. On the other hand, for all problems MP.Q.I. 
reaches the solution point but BFGS fails in four problems and the MBFGS fails in 
six problems. Finally for the total number of the function evaluations MP.Q.I. for 
a number of problems was less than that of BFGS or MBFGS methods. Therefor 
the modified P.Q.I, is superior to BFGS and MBFGS. 

In summary, the presented numerical results reveal that Algorithm 2.2, compared 
with BFGS, MBFGS methods, has many advantages for these problems. 
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Table 1. Test results for Algorithm 2.2/MBFGS/BFGS. 



Problems 


Dim 


MP.Q.I 


MBFGS 


BFGS 






NI/NF 


NI/NF 


NI/NF 


Rosenbrock 


2 


14/138 1000/1998 


33/53 


Freudenstein and Roth 


2 


5/59 


11/15 


9/21 


Brown badly scaled 


2 


5/108 


1000/2001 


Fail 


Jenrich and Sampson 


2 


6/73 


Fail 


11/23 


Helical valley 


3 


7/80 


38/43 


26/53 


Bard 


3 


6/56 


36/57 


30/93 


Gaussian 


3 


1/9 


3/5 


2/5 


Gulf research and development 


3 


15/167 


1/2 


1/2 


Powell singular 


4 


8/82 


29/34 


21/44 


Kowalik and Osborne 


4 


5/53 


27/29 


27/30 


Brown and Dennis 


4 


8/148 


Fail 


Fail 


Biggs EXP6 


6 


9/94 


33/36 


1000/3276 


Osborne 2 


11 


12/117 


Fail 


55/81 


Watson 


20 


22/274 


46/52 


41/77 


Extended Powell singular 


4 


8/82 


29/34 


21/44 


Penalty I 


2 


2/28 


10/12 


6/11 


Penalty II 


4 


2/31 


1000/1999 


13/23 




50 


25/238 


Fail 


193/913 


Variably Dimensioned 


2 


2/26 


9/11 


5/13 




50 


42/393 


60/66 


31/116 


Trigonometric 


3 


5/60 


26/30 


13/21 




50 


16/138 


30/31 


31/35 




100 


19/173 


87/93 


37/40 


Discrete boundary value 


3 


2/22 


17/26 


6/13 




10 


2/22 


20/29 


16/36 


Discrete integral equation 


3 


2/22 


7/10 


7/10 




50 


2/22 


7/10 


8/10 




100 


2/22 


7/10 


8/10 




200 


2/20 


8/11 


9/11 




500 


3/32 


8/11 


9/11 


Broydcn tridiagonal 


3 


4/48 


46/78 


114/29 


Broydcn banded 


50 


5/64 


Fail 


Fail 




100 


4/52 


Fail 


Fail 


Linear - full rank 


2 


1/9 


2/4 


1/3 




50 


1/9 


2/4 


1/3 




500 


2/30 


2/4 


1/3 




1000 


2/29 


2/4 


1/3 


Linear - rank 1 


2 


1/9 


2/4 


2/10 




10 


1/9 


2/4999 


2/21 


Linear - rank 1 with zero columns and rows 


4 


1/9 


2/4 


2/11 
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4. Conclusions 

In this paper, we give a modified method with the exact line search for uncon- 
strained optimization problems. The modified method also works with the inexact 
line search. The direction d r is constantly descent direction at any point. The 
modified method converges to a stationary point Regardless of the initial point. 
The given table shows that the modified method is faster than the others, which 
requires less iterations and function evaluations. 

Finally, this method is a powerful tool for complex systems or when the objective 
function is given in an implicit form, which mean that direct calculations of the 
gradient and Hessian matrix are difficult or even impossible to calculate for this 
function. 

For further research, we should study the numerical experiments for large prac- 
tical problems in the future. 
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SOME EXTENSIONS OF SUFFICIENT CONDITIONS FOR 
UNIVALENCE OF AN INTEGRAL OPERATOR 

NAGAT. M.MUSTAFA AND MASLINA DARUS 



Abstract. In this paper, we introduce and study a general integral operator 
defined on the class of normalized analytic function in unit disc. This operator 
is motivated by many researchers. With this operator univalence conditions 
for the normalized analytic function are obtained. We also present a few 
conditions of univalence for the integral operator. 



1. Introduction 
Let A denote the class of functions / in the open unit disc 

U = {zeC:\z\ <1}, 

given by the normalized power series 

oo 

(l.l) f{z) = z + Y J d k z k (z<=U) , 

fc=2 

where a k is a complex number. 

Let S be the subclass of A consisting of univalent functions. For functions / given 

by (1.1) and 

oo 

g{z) = z + ^b k z k , (zeU). 
Let (/ * g) denote the Hadamard product (convolution) of / and g, defined by: 



(/ * 9) ( z ) = z + ^2 akbk 



z k . 



k=2 



Many authors studied the problem of integral operators, acting on functions in S. 
In this sense, the following result due to Ozaki and Nunokawa [11] is useful to study 
the univalence of integral operator for certain subclass of S. 

Theorem 1.1. Let f € A satisfy the following inequality: 
z 2 f(z) 



-1 



P{z) 
then the function f is univalent in U. 



< 1, for all (zeU), 



Key words and phrases. Analytic functions, Univalent functions, Integral operator, Schwarz 
Lemma. 

2010 AMS Math. Subject Classification. Primary 40A05, 40A25; Secondary 45G05. 
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For some real p with < p < 2, we define a subclass S(p) of A consisting of all 
functions / which satisfy 



< p, for all (z eU). 



m 



In [15], Singh has shown that if / E S(p), then / satisfies 
z 2 f(z) 



P{z) 



<p\z\, for all (z E U). 



For two functions f E A and g(z) — z + J^ bkZ k , (z E U). Let the function 

k=1 



ip{a, c; z) be given by 

oo 

(1.2) (p(a,c;z) = ^2 



fe=0 



^V+\ (*eJ7,c^0,-l,-2,-3,...), 
(c)fe 



(»)* 



where (a;)*, denotes the Pochhammcr symbol (or the shifted factorial) defined by: 
[ 1 for k = 0, 
[a;(x+l)(a; + 2)...(a; + fe-l) /or fc e N = {1,2,3, ••• }. 

Note that ip(a, 1; 2;) = z/(l — z) a and t/?(2, 1; z), is the Koebe function. 
For a function / E A we introduced the following differential operator . 

D°(X,a)f(z) = f(z), 

D\X,a)f(z) = (l-^-)(/(*)) + -^-*(/(*)) , ) 

A + a A + a 



(1.3) 



# 2 (A,a)/(z) = D{D\\a)f{z)), 



D m (X,a) = D{D m -\X,a)f{z)). 



If / € A then by using the Hadamard product (or convolution) (1.4) and (1.2) we 

have 



D m (X, a, a, c) = D m (X, a) * ip(a, c; z), 



then 

00 
(1.4) D m (X,a,a,c) = z + J2 



k=1 



X(k-l) + a + X \ m (o)fc-i fc 

X + a J (c) fc _i 



where a, A > 0, a + X ^ 0, to E N. 



By specializing the parameters to, a, A, a and c, one can obtain various opera- 
tors, which are special cases of D m (X, a, a, c) studied earlier by many authors listed 
as follows: 

• D°(X,a, a, c) = L(a,c), due to Carlson and Shaffer operator [1]. 



D°(X, a, (5 + 1, l)(/3 > —1) = i?" due to Ruscheweyh derivatives operator [13]. 
£) m (l, 0,0,0) = S" 1 due to Salagean derivatives operator [6]. 
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• D m (X, 1 — A, 0,0) = S r p due to the generalized Salagean derivative operator in- 
troduced by Al-Oboudi [5]. 

• D m (A, 1 — A, (3+1, 1) = D 1 ^ o due to the generalized Al-Shaqsi and Darus derivative 
operator [7]. 

• Z?" l (l,a,0, 0) due to the multiplier transformations studied by Flett [12]. 

• -D _1 (l,o:,0,0) due to the integral operator studied by Owa and Srivastava [10]. 

Here we introduce a new family of integral operator by using generalized differ- 
ential operator already defined above. 

For fi £ A, i = {1,2,3, • • • ,n}, n e NU {0} and 71,72,73, • • • , 7«,p € C, we define 
a family of integral operator *f> m (\, a, a, c; z) : A n — ► A n by 

f(u w «)-(>(r-n( g!^«M )^|', 

where a, A > 0, a + A ^ 0, m € N. and D m (\,a,a,c)f(z) defined by(1.4) which 

generalizes certain integral operators as follow: 

(1) a = c = 0, a = —A + 1, 7$ = — , p = 1, we obtain J(/i, .../ m ) Bulut[9]. 

(2) ?n = 0, a = c = 0, 7^ = j^tj, p = n(pc — 1) + 1, we obtain -F n . Q (.z) Breaz [2]. 

(3) m = 0, a = c = 0, 7, = — , p = 1, we obtain .F Q (.z) Breaz and Breaz [3] . 

To discuss our problems, we have to recall here the following results. 

Lemma 1.2. [16]Lei the function f be regular in the disk 

U R = {zeC:\z\< R}, 

with |/(2)| < M for fixed M. If f has one zero with multiplicity order bigger than 
m for z — 0, then 

\f(z)\<^\zr,(zeU R ). 

The equality can hold only if 

f{z) = e ie { — )z m , 
where 9 is constant. 



Lemma 1.3. [8] Let f E A, and (3 be a complex number with 5ft(/3) > 0. If f 
satisfies 



1 - \z\ m ^ 



H(/3) 

then for all z G U the integral operator 



zf'(z) 



<1, 



F (z) = \ a I t^fitfdt 



is in the class S. 
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Lemma 1.4. [14] Let f e A and ftceC where *R(/3) > and (\c\ < 1, c ^ -1). 



cN 2/3 + (i-N 2/3 )^ W 



/?/'(*) 



<1, 



for all z G [/, £/ien £/ie function 



is analytic and univalent in U. 



/ t<^f\t)dt 



Now, we shall use the same method given by Breaz and Ozlem Giiney (2008) we 
prove the following results. 

2. Main results 
Theorem 2.1. Let M > 1, let a family D m (X,a,a 1 c)f i (t) e S{p),i = {1, ...n}, 
|D m (A,a,o,c)/ i («)| < M sucft ttat V\£S£ a ^]f { { $ - l\<p\z\, Vz e U. Lf 

" (l-p)M+l 

«(p) > 2^ uTj ' P>7»eC, 



(2.1) 



1 1 

™d |c| < i- s? 7TEn( p+1 ) M + 1 ' 



iften i/ie family ^ m (a, c, a, A, 7,; z) belong to S. 



Proof. Since 

D m (X,a,a,c)f(z) 



i+E 



fe=2 



A(fc - 1) + a + A^ m (a) fe _i fe _j 



X + a 



(c) 



fc-i 



Let 



1 if z = 0. 



z / r~)m 



m - f (< 



)ti •••( )t» ) ,//. 



D™(A 1 a 1 a 1 c)A(z) 1 ^ ro (A,a,a,c)/ n (z) v 
F(z) = ( )^---( )' 



therefore 



zF"(z) 



F'(z) 

(2.2) 

From (2.2), we have 



E 



1 fz{D m {X,a,a,c)fi{z)y 



-7, V r> m (A,a,a,c)/i^) 



-1 
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zF"{z) 



F'(z) 



< 



E 



1 ( z{D m (X,a,a,c)fi{z))' 



|7i| V D m {X,a,a,c)fi{z) 



n 



? = 1 



z 2 (D m (X,a,a,c)f t (z)Y 



(£>™(A,a,a,c)/^)) 2 



|(J m (A, a , a ,c)/ t (z))| 



(2.3) 



From the hypothesis , we have \D m (X, a, a, c)fi(z)\ < M, z € U,i = {1, ...,n}, then 
by lemma 1.2, we obtain that: 

|D m (A,a,a,c).ft(z)|<M|4 
We apply this result in inequality (2.3) then we obtain: 



zF"{z) 



F'(z) 



< 



< 



n , 



z 2 {D m {X,a,a,c)f t {z))' 



(D^(X,a,a,c)Mz)y 



Ei 



1 . f.z 2 (D m (X,a 1 a,c)f l (z)Y 



^VV (^ m (A,a, a ,c)/ 2 (z))2 

n 1 

= Ei-i(p M i 2 i + M+1 ) 



M + l 



-1|M + M + 1 



7* 



n 1 
< X)IH((P+1)M+1). 



We have 






ci^+ a - \z\*)±±h z ™y: c }$f ~ 1) 



* |c|+ ^' 



^P~{li D m (X,a,a,c)fi(z) 
z 2 (D m (X,a,a,c)fi(z)y 



(D^(X,a,a,c)Mz)Y 



\(D m (X,a,a,c)fi(z))\ | 



We obtain 



~p>(~\ 1 ™ 1 



So, from (2.1) we have 



izi 2p + ci - bi 2 n zf — 

11 +l ' ' J HpF(*) 



< 1. 



Hence we complete the proof. 



□ 



Remark 2.1. If we set a = c = m = in Theorem 2.1, then we have result in [4]. 
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Corollary 2.2. Let M > 1, let a family D m (X,a,a,c)f t (t) € S(p),i = {l,...,n}, 
|£> m (A,a,o,c)/ i (z)| < M, such that l ^Z^Cw^S' ~ l \^M, Vz€U - V 
<«/ x "(1 -p)M+ 1 



and |c| < 1 — 



1 /(n(p+l)M+l) 



K(/t>) \ |7l 

i/ien i/ie family ^ m (a, c, a, A, 7$; z) belong to S. 

Proof. In Theorem 2.4, we consider 71 = 72 = ■■■"f n = 7- 



a 



Corollary 2.3. Let the family D m (\,a,a,c)f t (t) € S'(p),i = {l,...,n}, |-D m (A, a, a, c)/,(z)| < 

1 ™ch that \ % D s ( trMhi$ - !i ^ *n> Vz e ^ 7 / 

" 77+2 

1 n 1 

and |c| < l- fi 7T^ n ( !, + 2 '' 

iften i/ie family \I/ m (a, c, a, A, 7$; z) belong to S . 

Proof. In Theorem 2.1, we consider M = 1. D 



Theorem 2.4. Lei a family D m (X, a, a, c)/,(i) e A, |£> m (A,a, a, c)/ 4 (»| < M, Vi 
such that \ z J 2 m^^MW - 1| < 1, V* € 17. // 



(£>'"(A,a,o,c))/ i ( Z )) 



5R(/7) > £ 



i=l 



(2M + 1) 



M> 1, ft 7,€C, 



(2.4) 



and |c| < 1 



1 ^ (2M + 1) 



E 



»(p) tt N 
iften i/\e family ^ a ' ,p (m, q, A, 7$; z) belong to S. 



Proof. We know from the proof of Theorem 2.1 that 



zF"(z) 
F'{z) 



y l_ ( z(D m (\,a,a,c)f t (z)y _ ( 



^ 7l V D m {X,a,a,c)f t (z) 



Because 



c|z|2p + (1 _ |z|2p) _l_ f< l ( Way)M _ 1} 



3?P tt ^ 
I ^ : I , / z z (D m (X, a, a, c)/ t (z))' |(J m (A, a, a, c)/,(z))| + ., 



|C|+ |P|§VV (I> m (A ) a,o,c)/ i («)) 



Where 



rZ 2 (£> m (A,a,a,c)/ i (z)) / 

1 (L>"(A,a,a,c).A(z)) 2 



< 2, and |(D m (A,a,a,c)/ l (z))| < M\z\, 
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so 



\z\ 2p + (l- \z\ 2p ) zF '^ 
11 +( ' ' >^ P F(z) 1 






using (2.4) we have 



\z\ 2p +(l - \z\ 2p ) ZF '^ 
11 +[ ' ' 'npF(z) 



< 1. 



Applying Lemma 1.4, we obtain the family ^f a ' ■ p {m 1 q, A, 7$; z) belong to 5. □ 



Theorem 2.5. Lei M > 1, Zei a family D m (\, a, a, c)/,(t) e S(p),i = {l,..n}, 

| Z 2 (£> m (A,a,a, C )). 
I (ZJ">(A,a,a,c))/i 

(l+p)M+l 



|D™(A,a,a,c)/;(;z)| < M such that l ^JX^)/!^' ~ l \^M, V* € 17. 7/ 



R(/J) > X! 



iften i/ie family \E ,m (a, c, a, A, 7$; z) belong to S. 



p,ii e L, 



Proof. We know from the proof of Theorem 2.1 that 



zF"(z) 



F'(z) 



< 



n 1 

E 1 

±?7* 



z(D m (A,a,a,c))' 



- 1 



So, by the imposed conditions, we find 



1 - |z| 2 *W 
H(/3) 



«F"(z) 



F'(z) 



< 



< 



< 



< 



E^ 



! _ | z |2K(/3) 

1 _ \ z fR(P) " 1 

1 _ |^|2»(ff) " ! 



D m (A,a,a,c) 

z(/J m (A, a ,a,c)/ 4 (z))' 



L> m (A,a,a,c)/ 4 (z) 

^(^(A.a.o.cJ/iW)' 



(/^(A.a.a.c)/^*)) 2 

z 2 (L> m (A,a,a,c).ft(z))' 



£> m (A,a,o,c)/ j («) 



(^(A.a.a.c)/^)) 2 



- 1 



M + M + l 



n 

W E-((1 + ")M + 1) 



7* 



< 1. 



By applying Lemma 1.3, we prove that \& a ' ' p (m, q, A, 7$; 2) E S. 



□ 



Corollary 2.6. Lei M > 1, Zei a family D m (X,a,a,c)f l (t) € 5(p),« = {l,...,n}, 
|/J m (A,a,a,c)/ a (z)| < M such that 1 %^^^' - 1| <pN, Vz G [/. // 



then the family \& m (a, c, a, A, 7$; z) belong to S. 

Proof. In Theorem 2.5, we consider 71 = 72 = ...7™ = 7- 



a 
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Corollary 2.7. Let a family D m (A, a, a, c) /,(i) G S(p),i = {1, ...,n}, \D m (\, a, a, c)fi(z)\ < 
1 ^ch that l ^i^ftffl - 1| < pM, Vz G 17. // 

" 9-4- 

tften the family \I/ m (a, c, a, A, 7$; z) belong to S. 

Proof. In Theorem 2.5, we consider M = 1. □ 
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PERFORMANCE EVALUATION OF OBJECT CLUSTERING USING 
TRADITIONAL AND FUZZY LOGIC ALGORITHMS 

NAZEKAL-ESSA AND MOHAMED NOUR 

ABSTRACT. This work analyzes three object clustering algorithms mainly: the k-means, fuzzy c-means 
(FCM), and kernel fuzzy c-means (KFCM). The k-means algorithm partitions a dataset into hard clusters. 
Both FCM and KFCM are based on fuzzy logic and they return a membership degree of each object to all 
clusters. The algorithms are implemented and run on two different datasets. It is shown that the KFCM 
algorithm can achieve better results than the other two algorithms. The FCM is slightly better than the k- 
means algorithm. Moreover, the clustering time of each algorithm is different from the others. The clustering 
time of KFCM was larger than both the k-means and FCM. Clustering using the k-means has the smallest 
clustering time. The overall performance is affected by the dimensionality value, fuzziness parameter, 
number of objects, iterations, clusters, and the number of updating operations for both cluster centroids and 
membership values. 



1. INTRODUCTION 

Clustering algorithms play an important role in the field of data analysis. Such algorithms partition 
an object set into subsets or clusters where the objects in each cluster share some common properties, 
i.e. the objects belonging to the same cluster are as similar as possible whereas objects belonging to 
different clusters are dissimilar as possible. 

There are several approaches for conducting the clustering process. An example of such 
approaches is that one based on fuzzy clustering. Fuzzy clustering in contrast to the usual methods 
does not provide hard clusters, but return a degree of membership of each object to all clusters. The 
interpretation of these degrees is then left to the user that can apply some kind of threshold to generate 
hard clusters or use these soft degrees directly [1]. The necessity of fuzzy clustering lies in the reality 
that a pattern could be assigned to different classes and/or categories. In fuzzy clustering, a result is 
represented by degrees of membership of every pattern to the established classes [2]. 

There are several types of fuzzy clustering. This includes those methods based on fuzzy relations, 
objective functions, adaptive resonance theory, competitive learning, and others. Several research 
efforts were done in the area of fuzzy clustering. Examples of such efforts are briefly mentioned as 
folio ws:- 

[3] mentioned that document clustering has been widely applied in the field of information retrieval for 
improving search and retrieval efficiency. The authors proposed a scalable fuzzy algorithm for 
document clustering. The algorithm discovers relationships between information resources based on 
their textual contents. The algorithm addresses the problem of defining a suitable number of clusters 
for appropriately capturing all the topics of the knowledge domain. 

[4] presented a sample weighting clustering algorithm based on K-means and fuzzy C-means 
approaches. The algorithm uses academic documents as the clustering objects. Experiments show that 
the proposed algorithm is an effective solution to improve the performance of document clustering. 

[5] described an algorithm for building fuzzy hierarchies. The author presented an approach that 
mainly follows a bottom-up strategy, and described the functions needed to operate with fuzzy 
attributes. The author presented an application example for that approach. 
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[6] presented a way to use fuzzy clustering for generating fuzzy rule bases in the implementation 
of an intelligent agent. That agent interacts with human for diagnosis establishment: the medical 
diagnostics system. The results were compared with the results in some subtests of the Aachen Aphasia 
Test (AAT). 

[7] developed an interactive content -based image retrieval (CBIR) system that allows searching 
and retrieving images from databases. Based on the fuzzy-c-means clustering algorithm, the CBIR 
system fuses color and texture features image segmentation. A database consisting of skin cancer 
imagery is used to demonstrate the applicability of such CBIR system. 

[8] introduced a fuzzy-based distributed clustering algorithm to cluster distributed datasets 
without necessarily downloading all the data into a single site. The algorithm is compared against two 
existing distributed clustering algorithms where all data are merged into a single data source and 
clustered. The experiments confirmed good performance of the proposed algorithm. 

The organization of this paper will be as follows: Section 2 addresses; in addition to the work 
objective; the main difference between crisp and fuzzy clustering. Sections 3, 4, and 5 present the 
adopted algorithms respectively. The implementation work is presented in Section 6 while Section 7 
discusses the results. Finally, Section 8 concludes the whole work. 

2. OBJECT CLUSTERING BASED ON CRISP CLUSTERING AND FUZZY CLUSTERING 

Regarding the cluster analysis and/or cluster algorithms, there are two main approaches: crisp 
clustering and fuzzy clustering. In many real cases of crisp clustering, the boundaries between clusters 
cannot be clearly defined, i.e. some objects may belong to more than one cluster. In such cases, the 
fuzzy clustering method provides a better method to cluster such objects. Membership degrees between 
zero and one are used in fuzzy clustering instead of crisp assignments of those object clusters. 

This work aims at presenting an analysis and investigation of three algorithms for clustering 
objects. The performance of such algorithms is also evaluated. The adopted algorithms are: the K- 
means algorithm, the fuzzy c-means algorithm, and the kernel fuzzy c-means algorithm. The steps of 
such algorithms are briefly mentioned in the following sections. 



3. CLUSTERING OF OBJECTS USING THE K-MEANS ALGORITHM 

The K-means algorithm provides an easy method for clustering objects or data set. For a given 
dataset of n data points (or objects) x u x 2 , ..,.x n each is in R d , the k-means algorithm classifies such 
points into k-clusters. Initially, it is easy to define k centroids; one for each cluster. Such centroids are 
better to be chosen as much as possible far away from each other. Each point in the dataset is 
associated to the nearest centroid. K new centroids are recalculated resulting from the previous step. 
After that a new binding has to be done between the same dataset points and the new centroids. The k- 
means algorithm iteratively updates these centroids to decrease the objective function shown in 
equation (3.1). The algorithm always converges to a local minimum. As a result, the k centroids 
change their location step by step until no more changes are done. .i.e. the algorithm updates the 
cluster centroids till local minimum is found. Moreover, the objective function J which the algorithm 
aims to minimize is a square error function as follows: 



'-Zm2L 



xjf'-vj 



169 



where 



■P—i* 



N. AL-ESSA AND M. NOUR 



is a chosen distance measure between a data point x J and the cluster center v 



i.e the objective function involves the distance of the n data points from their respective cluster centers. 
In other words, some researchers use the similarity measure instead of using the distance measure 
mentioned in equation (3.1). The similarity measure between any two objects or data elements x t , and 
Xj (i.e. sim(x„ xj)) is the cos value between such elements. In fact, that measure is preferred. The 
generalized pseudo code for the k-means algorithm can be written as shown in Figure 1 . The algorithm 
was taken from [10, 14]. 

Algorithm: The k-means algorithm 

Input: 1. A set of objects or data points with d-dimensional each x h i=l, 2, ...n 

2. The number of clusters k 
Output: k clusters of data points 
Steps: 

1 . Generate k centroids Vj^ V 2 , ,V^ by randomly choosing k data points from X 

2. REPEAT UNTIL there is no change in clusters between two consecutive iterations 
FOR each data point X j in X 

FOR;'=l tok 

Sim( Vj, Xj ) = Find cos similarity between X^ and Vj 

ENDFOR 

Assign X i to cluster j which Sim( Vt, x£ is maximum 

ENDFOR 

Update the centroid for each cluster 
END LOOP 
END K-means Algorithm 

Figure 1 : The Generalized Pseudo Code for the K-Means Algorithm 



CLUSTERING OF OBJECTS USING THE FUZZY C-MEANS ALGORITHM 



Clustering methods can be considered as either hard (crisp) or fuzzy depending on whether a 
pattern data belongs exclusively to a single cluster or to several clusters with different degrees. In hard 
clustering, a membership value of zero or one is assigned to each pattern data (feature vector), whereas 
in fuzzy clustering a value between zero and one is assigned to each pattern by a membership function. 
The fuzzy c-means (FCM) is one of the fuzzy algorithms that can be used in a wide variety of 
engineering and scientific disciplines. The FCM is a clustering method that allows one piece of data to 
belong to two or more clusters. The FCM aims at minimizing the following objective function: 

C4-D J m=2Li=Xj=lH 



in 
U 



X i ~ V J 



I <m < oo 



where m is any real number greater than one, fig is the degree of membership of Xj in the clustery, X^ is 
the i th of d-dimensional measured data, Vs is the d-dimensional center of the cluster, n is the number of 
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data points, c is the number of clusters, and II* II is any norm expressing the similarity between any 
measured data and the center. 

The clustering process using the FCM algorithm can take place through an iterative optimization 
of the objective function shown in equation (4.1). This is done by updating the membership ^;j- and 

the cluster centers Vj as follows:- 
(4.2) ftu = 



■ J k=±\ 



(4-3) Vj = 

The iteration stops when 



tlk-^llj 



™ 



max i} - [\ii.. - /i.j. |j < s 



where £ is a termination criterion between zero and one. Also k is the iteration steps. 

Figure 2 briefly shows the main steps of the fuzzy c-means algorithm (FCM). The algorithm was 
taken from [1,2, 5, 9]. 

Algorithm: The Fuzzy C -Means Algorithm 
Input: 

1 . n data elements x t — 1, 2, . . .n ; each is of d-dimensional measured data 

2. The number of clusters c 
Output: 

C clusters with data elements; each with memberships values to the clusters 
Steps: 

1 . Initialize £/=[/*»/] matrix w(0) U { - * 

2. At k-step, calculate the center vectors V'- kj = [vj ] with U'- k ^> 



3. Update tfW,tf(M*> 






Pu 



ij: = l 



Fi ~ ^j 



m-1 



4. If IF ||f/ i; ' r+:L) - f/ (t5 || < £: then stop; otherwise return to step 2. 

//* £ is a termination criterion between zero and one and k is the iteration step. *// 
End of the Fuzzy c-means Algorithm 

Figure 2: The Main Steps of the Fuzzy C-Means Algorithm 
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5. CLUSTERING OF OBJECTS USING THE KERNEL FUZZY C-MEANS ALGORITHM 

The kernel fuzzy c-means approach (KFCM) is mainly based on the fuzzy c-means algorithm. 
That approach tries to minimize the following objective function: 

(5.i) f m {u,V) ^EitES^S ll* ft - VilF + E^j E£ =1 (i- /O m 

where <?j are suitable positive numbers, m is the quantity controlling clustering fuzziness and it is 
greater than one as mentioned before. Also, 2i=iMik = 1> ^i s tne set of cluster centers or prototypes ( 
l?i £ i? p ). The function J m should be minimized. 

The first part in equation (5.1) needs to be as low as possible; whereas the second part forces Hi k 
to be as large as possible. Moreover, 3 is better to be chosen as 

(5-2) Bi-K _ 



I- 



L^ 



Typically, K is chosen to be 1 . The cluster centroids or prototypes can be updated like those used in the 
fuzzy c-means, but the memberships of this approach can be updated as follows: 

(5.3) t± ik = 



■ - ■ 
1+ 



i 



i | W "iiiy 11 -*) 



Figure 3 shows the main steps of KFCM. The algorithm was taken from [11]. 

Algorithm: The Kernel Fuzzy C-Means Algorithm 

Input: 

1. n data elements x, = 1, 2, ...n ; each is of d-dimensional measured data 

2. The number of clusters c 
Output: 

C clusters with data elements; each with membership values to the clusters 
Steps: 

1 . Fix c, t ma3! , m> 1 , and S >0 

2. Initialize /ij j; 

3. Estimate e?j using equation (5.2) 

4. FOR t =1,2, t max ,DO 

Update all cluster centers or prototypes V i with equation (4.3) 

Update all memberships ft ik with equation (5.3) 

Compute E e =max i/k \^ k ~ ^f^ 1 ] , IF E* <,?STOP; ELSEt=t+l 

End of the Steps of the Kernel Fuzzy c-means Algorithm 

Figure 3: The Main Steps of the Kernel Fuzzy C-Means Algorithm 
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6. IMPLEMENTATION WORK 

The algorithms have been implemented and tested with two benchmark numeric datasets. The datasets 
are: SPECT-heart and liver-disorder datasets. Such datasets were taken from the UCI machine 
learning data repository. The main features and/or characteristics of the chosen datasets are briefly 
shown in Table 1 [13]. 

Table 1 : The Chosen Datasets as Test-beds 



Dataset Name 


No. of Attributes 


No. of Instances 


Area 


No. of 
Web Hits 


Date Donated 


SPECT Heart 


22 


267 


Life 


51564 


2001-10-01 


Liver Disorder 


7 


345 


Life 


20782 


1990-05-15 



The experiments were run for partitioning each dataset into a set of clusters mainly: two, four, six, and 
eight clusters respectively. The results were reported for such numbers for the k-means algorithm. For 
the fuzzy c-means and kernel fuzzy c-means algorithms; the reported results were only for two and 
four clusters for the space limitation for writing these pages. 

6. 1 APPLYING THE K-MEANS ALGORITHM 

As mentioned before, the k-means algorithm is based on the idea that a center point can represent a 
cluster. Also, the Cos measure is used to compute the similarity between any two data points x-, and Xj 
[12, 14]. 



(6.1) 



Cos (x h Xj) = (xj . Xj) I II Xj II II Xj II 



Where '.' indicates the vector dot product and II x II is the length of vector x. 

It is assumed that a dataset S of data items and their corresponding vector representation, the centroid 

vector v can be defined as: 



(6.2) 



v=(i/isi)E? rf *i 



The similarity between a centroid vector v and any data item x is calculated as in equation (6.3). Also, 
the similarity between any two centroid vectors tf; and Vt is calculated as in equation (6.4). 



(6.3) 
(6.4) 



Cos (x , v) = (x . v)l II x II II v II 

COS (V;, Vj) = (V;, Vj) I II V; II II Vj II 



Moreover, two measures can be used to evaluate the quality of clusters mainly the intra-cluster 
similarity (intra-CS), and the inter-cluster similarity (inter-CS). Such measures are written as follows: - 

(6.5) Intra-CS(/') = ( 1 / \S\) 2 xcS Cos (x, vj) 

The average over all the clusters is 
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(6.6) 



Avglntra-CS = (1 IK) ^ =1 lntraStrrt{i) 



Where k is the number of clusters. 

Regarding the inter-CS, the pairwise similarity between each pair of clusters is measured by computing 
the similarity between their centroids. Also, the average inter-cluster similarity (Avginter-CS) is 
measured by computing an average over all the distinct pairwise similarity between centroids. 



(6.7) Avginter-CS = {21 K (K-l)) I 1 f={;I 1 j= i + 1 SimJ[y i ,Vj) 

The obtained results after running the k-means algorithm on the chosen datasets are shown in Figure 4. 




K-means, Liver Data 
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Figure 4a: Average Similarity for K-means Figure 4b: No. of Objects for Two Clusters 
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Figure 4c: Centroids of Two Clusters 



Figure 4d: No. of Objects for Four Clusters 



K-means, Liver Data 



0.3 

S 0.25 

-cl 5 0.2 

-a 2 0.15 

C3 I 0.1 
_c4 0.05 






•-■- 



Figure 4e: Centroids of Four Clusters 
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Figure 4f: No. of Objects for Six Clusters 
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K-means, Liver Data 
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Figure 4g: Centroids of Six Clusters 



K-means, Liver Data 
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Figure 4h: No. of Objects for Eight Clusters 
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Figure 4i: Centroids of Eight Clusters 
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K-means, Heart Data 




Figure 4j: Average Similarity for K-Means 
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Figure 4k: No. of Objects for Two Clusters Figure 41: Centroids of Two Clusters 
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Figure 4m: No. of Objects for Four Clusters Figure 4n: Centroids of Four Clusters 
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Figure 4o: No. of Objects for Six Objects 



Figure 4p: Centroids of Six Clusters 



K-means, Heart Data 




K-means, Heart Data 




Figure 4q: No. of Objects for Eight Clusters Figure 4r: Centroids of Eight Clusters 
6.2 APPLYING THE FUZZY C-MEANS ALGORITHM 



The experiments for this algorithm were run for different values of the fuzziness parameter (m). 
The adopted values of that m were: 3, 2.5, 2, and 1.5 respectively. For each value of m; a set of 
iterations were run. The number of objects and the centroids of the fuzzy clusters were reported. 
Moreover, both the centroids of the fuzzy clusters and the membership values are iteratively 
recomputed and updated. The iterations stop after achieving the stopping criterion. Different values of 
m were tested; as mentioned above; to see the effectiveness and role of the fuzziness parameter on the 
obtained clusters. This is important for evaluating the performance of the algorithm. Figure 5 presents 
the cluster centroid values versus the iteration number for the adopted datasets. 
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Figure 5f: Centroids of Two Clusters 
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Figure 5g: Centroids of Four Clusters 
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6.3 APPLYING THE KERNEL FUZZY C-MEANS ALGORITHM 

The experiments implemented for the fuzzy c-means algorithm are also operated for the kernel 
fuzzy c-means algorithm. The centroids of the fuzzy clusters and the membership values are also 
iteratively updated. As the centroid and membership values change, the objective function also 
changes. For the same experiments, the objective function in this algorithm is different from the 
corresponding values in the FCM due to the added part in the objective function. Such part forces the 
membership values to be as large as possible. Figure 6 shows the results of applying this algorithm on 
the adopted datasets mentioned before. 
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7. DISCUSSION OF RESULTS 

The adopted algorithms were iteratively run on the two datasets mentioned above. For the K- 
means algorithm, the average intra-cluster similarity (Intra-CS) and the inter-cluster similarity (Inter- 
CS) change by changing the number of clusters. As the number of clusters increases the intra-cs 
increases while the inter-cs decreases. The intra-cs is always greater than the inter-cs for the same 
number of clusters. Each iteration the K-means is run, the number of objects in each cluster and their 
cluster centroids are different. This is due to the massive movement of objects among clusters in the 
initial iterations. That movement has been stopped in the last iterations. This had been occurred for the 
two datasets for two, four, six, and eight clusters respectively. Running the algorithm on the two 
different datasets for the same number of clusters gives different values of cluster centroids and 
number of objects. This means that the size, nature, and characterization of each dataset are significant. 
For both the fuzzy c-mean (FCM) and kernel fuzzy c-mean (KFCM) algorithms, the fuzziness 
parameter has a significant effect on the fuzzy clustering. The algorithms were iteratively run for four 
different values of the fuzziness parameter. The quality of the fuzzy clusters is improved by decreasing 
the fuzziness value, i.e. For the higher fuzziness values, the fuzzy cluster centroids became more closer 
to each other or sometimes equal. This causes the membership values of an object to all clusters to be 
approximately the same. This causes bad clustering quality. The best fuzzy clusters were for the small 
values of the fuzziness parameter near to one. This was valid for the two algorithms for the chosen 
datasets for two and four clusters. Moreover, the rate of change in cluster centroids was large for the 
initial iterations and very small at the last iteration. For the same iteration, the same fuzziness value, 
the same dataset, the KFCM algorithm outperforms the FCM algorithm. This is clear in the fuzzy 
cluster centroids which show higher differences than those corresponding in the FCM algorithm. As a 
result, the membership values were higher for the KFCM which means better quality. The reason for 
this refers to that added part in calculating the objective function in the KFCM algorithm. The fuzzy 
clustering time in the KFCM algorithm was higher than its corresponding values of the FCM algorithm 
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and this is clear from the extra iterations consumed in running the KFCM algorithm. Regarding the 
time complexity, the clustering time for the three algorithms was different. The highest clustering time 
was for the KFCM algorithm and the smallest one was for the K-means algorithm. 

Finally, the size of the chosen dataset is significant, i.e. For each dataset, the number of objects 
with the properties of each plays a vital role in the clustering time. Such time is always affected by 
several factors mainly: the number of iterations, the number of clusters, the number of objects, the 
dimensionality of each object, and the adopted algorithm. 

8. CONCLUSION 

This work presents an investigation of three clustering algorithms. As the number of clusters 
increases in the K-means algorithm, the intra-cluster similarity increases while the inter-cluster 
similarity decreases. The intra-cluster similarity is always greater than the inter-cluster similarity. Each 
time the algorithm is run, the objects in clusters were different in several iterations till the movement of 
objects among clusters has been stopped. 

For both the fuzzy c-means and kernel fuzzy c-means, the fuzziness parameter has a significant 
effect on the clustering process. The best clusters occurred for small values of the fuzziness parameter 
near to one. This is due to the different centroid values of fuzzy clusters. Moreover, the differences 
among cluster centroids in the kernel fuzzy c-means were greater than their corresponding values in 
the fuzzy c-means. This had led to better quality clustering for the kernel fuzzy c-means. The kernel 
fuzzy c-means algorithm outperforms the other two algorithms. This is due to the added part in the 
objective function which forces the membership values to be as large as possible. Moreover, the 
clustering time was highest for that algorithm and lowest for the k-means algorithm. Finally, the 
clustering time increases in all algorithms by increasing the number of clusters. 
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NINE POINT MULTISTEP METHODS FOR LINEAR 
TRANSPORT EQUATION. 

PARIA SATTARI SHAJARI AND KARIM IVAZ 



Abstract. In this paper we construct a family of multistep methods for the 
linear advection equation, based on nine points of a rectangle form grid. Square 
polynomials arc used for this purpose. Numerical examples show the perfor- 
mance of the different methods according to the choice of the parameters. 



1. Introduction 

In this paper wc generalize one step methods introduced in [1] for the linear 
hyperbolic scalar equation: 

(1.1) L(u) — u t + cu x = 

where c is a positive constant. Also we consider linear advection-diffusion equation 
of the form: 

(1.2) L(u) = u t + cu x - vu xx = 

where v > is the diffusion coefficient. Initial conditions and inflow boundary 
conditions are provided in the usual way, so that the solution of (1.1) turns out to 
be a traveling wave preserving its shape. However, the numerical method we are 
going to present here for the simple linear case will be significant enough to enable 
many interesting conclusions to be drawn. 

The general theory on hyperbolic equations and conservation laws has already 
generated an enormous amount of literature (see for instance [2] , [7] ) . The relevance 
of advection-dominated problems is also testified to by a number of recent papers 
dealing with a variety of approximating methods and numerical schemes [6]-[10]. 

We use nine points of a rectangle form grid with square polynomials to construct 
multistep methods for solving Eq. (1.1). 

2. Nine point multistep methods 

Throughout this paper we only consider finite-difference methods for solving 
equations (1.1) and (1.2) on t <G [0, T] and x E [0,L], based on nine points of a 
rectangle form grid (see figure 2). 

Thus, in the (x, t) plane we take a uniform grid of width h = Ax — L/M 
and a constant time-step At = T/N. We use the nine point (xj-i,tk-i), (xj,tk), 
...,(xj+i,tk+i) for constructing an interpolation of approximate solution. Hence, 
we introduce following Lagrange basis for any fixed j and k 

V = {l t (x)G m (t) G p {2 ^ : % = 0, 1, 2, m = 0, 1, 2} 



Key words and phrases. Linear transport equation, Finite difference methods, Multistep meth- 
ods, Advection-diffusion equation. 

2010 AMS Math. Subject Classification. 65N06. 
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(cCj_i,ife+i) (Xj,tk+l) (Xj+i,tk+l) 



(Xj-i,tk) (%j,tk) {Xj+i,tk) 

° collocation point (£, r) 

(Xj-l,tk-l) (Xj,tk-l) (Xj+i,tk-l) 

Figure 1. Nine points of a rectangle form grid 



where 



1 

2h 2 ' 
-1 



L o( x ) = 7^2 ( x ~ x i)( x ~ Xj+l), 



L i( x ) = T^( x ~ x j-i)( x - x i+i)i 



1 

2h 2 ' 
1 



L ^ X ) = ^2^ X ~ X J-l)( X ~ X j)> 



G o(t) = ^(t-tj)(t-t j+1 ), 



•1 

At?' 



Gi(t) = in5 (t-t j -i)(t-t j+ i) 



and 



G*(t) = W^(t-t j -i)(t-t j ). 



1 
2Ai 2 ' 
Therefore, for any p € p( 2 > 2 ), we have 

2 2 

p(x,t) = Y / Y,pjtr--i 1 L l (x)G m (t) 

where, to simplify the notation, we set p^ = p(Xj,tk)- For approximating solution 
of equation (1.2) in the space V, we substitute 

2 2 

Px (x,t) = E E P-tr-i'L^G^t), 

2 2 

Pxx (x,t) = E E P^l'^W^mW 

i=0 m—0 

and 

2 2 

p*(M)=EEi + .-i li .w G ™( f )- 

i=0 m=0 

in (1.2) and we obtain 

(2.1) 

2 2 

i?(x,i) = L(p) =J2H P'tr-'i 1 (M x )G' m {t) + cL[{x)G m (t) - vL'[{x)G m {t)) . 

i=0 m=0 

The residual term R should be zero in a same sense. Let (£, r) be a point in the 
rectangle of vertices (xj-i,tk-i), (xj+i,tfe_i), (zj+i^+i) and (xj_i,tfe + i) (sec 
figure 2). Then, the numerical scheme will be obtained by requiring the residual R 
to vanish at (£,t). Hence, we have 

(2.2) i?(£,r) = 0. 
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For simplifying the notations, we let 

s = £ - X 3-l 
and 

r = t -tk-i- 
Now we change coordinate by translating Xj-i and tk-i to the origin of coordinate. 
To do this we use 

Lo(0 =^2 (£ - Xj)(£ - x j+1 ) = ^(- h + s )(-2h + s), 
L'o(0 =^ ((£ - Xi) + (£- x j+1 )) = ^ (-3/1 + 2s), 



i 2 (0 = 



h 2 ' 

s(s — h) 



2h? ' 

(r-At)(r-2At) 
Go(T) = 2A1 2 ' 

(2r-3At) 
Go(T) " 2 At 2 ' 

-r(r-2At) 



Gi(r) = 
G 2 (r) = 
G' 2 (r) = 



At 2 
-(2r - 2 At) 

At 2 : 
r(r - At) 
2At 2 ' 
(2r - At) 



2At 2 
and introduce 

« tm := HOG'^t) + cLj(^)G m (T) - <(OG m (r) 

to obtain 

a 2PjH+a 12 p k j +1 + U22P k >Xl 

(2.3) = - a iPj_i - anPj - a2iPj + i 

- uoop^zl - aioPj' 1 - «20Pj+i 

for k = 1, ..., iV— 1. The system (2.3) shows a two steps method and can be solved if 
the initial steps were given. The initial steps can be found using boundary conditions 
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and a one step method. In this paper, we use a six-point stencil method introduced 
in [3] .The collocation parameters s and v for the introduced nine-point method are 
exactly the same parameters s and v used in the six-point stencil method. Now, 
suppose following initial conditions was given 



(2.4) 

(2.5) 

and the Neumann condition 



u(x,0) = f(x) 

u{0,t)=g(t) 



du, 



d.r 



= L=0. 



\x=L 



(2.6) 

This condition implies that 

(2.7) P°j=f{xj), for j = 

(2.8) P k a =g(tk), for k = 
and the Neumann type constraint 

(2-9) P k n-l=Pn- 

Equations (2.3)-(2.9) can be written in the matrix form 

(2.10) 
where 



M 

N 



HP 



k+l 



D k 



( (XXI «22 

a a2 Qi2 a 22 
a 2 ai2 «22 



(2.11) 



(2.12) 



H 



\ 







a Q2 &\2 a 22 
0-11 



V 



/ 



/ -aoiP§ - auPi - U21P2 - a ooP - a w p 1 - a 20 p 2 - a 02 g(t k+1 ) \ 



D h 



\ 



-OiOlPM-2 - a llPM-l - "21Pm - "OOPm-2 _ "IOPm-1 ~ a 20PM * 



J 



and P k+1 = [p^ +1 , . . . ,p^+ 1 ] T . To obtain the initial steps p 1 , we solve following 
equations (see [3] Eq. (3.6)) 

p) + A(p) +1 - 2p) + p)^) + B{p) +1 - p)^) = 

p? + C(p° 1+1 - 2p° +1 + p° j _ 1 ) + D(p° j+1 -p^) 



(2.13) 
with 



A 



(2.14) 



C = 



s 2 — 2r(cs + v) 
2~h 2 ' 

s 2 + 2(Ai - r)(cs + v) 



B 



D = 



2h ' 

-s - c(At - r) 



2h 2 " 2h 

Now the approximate solution can be obtained by solving (2.10) for k = 1, . . . , N. 
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2r 

1.8- 
1.6- 
1.4- 
1.2- 

1 - 
0.8- 
0.6- 
0.4- 
0.2- 

0- 



- exact solution 
approximate solution 



0.4 0.6 0.8 




Figure 2. Numerical solution of equation (1.1) with v = 0, and 
s = h, r = h/c 





0.2 0.4 0.6 0.8 1 



Figure 3. Numerical solution of equation (1.1) with v = 0, and 
s = h/2, r = h. 



3. Numerical examples 



In this section we discuss a series of numerical examples according to different 
choices of the collocation points. We deal with equation (1.1) for x > and c > 0, 
where the following discontinuous initial datum is considered: 



(3-1) 



u(x, 0) = Uq(x) 



1, for x = 0, 
0, for x > 0. 



187 



PARIA SATTARI SHAJARI AND KARIM IVAZ 



2r 
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- exact solution 
approximate solution 



- exact solution 
approximate solution 




Figure 4. Numerical solution of equation (1.1) with v = 0, and 
s = h/2, r = 3At/2. 



it=0.001,h=0.1 





Figure 5. Numerical solution of equation (1.1) with v = 0, and 
s = 3/i/4, r = 5At/3. 



(3.2) 



u(x, t) 



The boundary condition u(0,t) = l,t > 0, is imposed at the inflow boundary, so 
that the exact solution is: 

1, for x < ct, 

0, for x > ct. 

The discontinuity of the solution provides a challenging test for comparing the 
performances of different schemes. We fix the parameters in the following way: 
c = 1, and h = Ax = 0.01, while various regimes have been chosen for At, In figure 
(2)-(5) we present the main results of our experiments for two different choices of 
At. i.e.: At = h = 0.01 and At = 0.001, respectively. The dashed lines show the 
evolution of the exact solution at times t\~ = fc/5, with k = 1, . . . , 5. The plots are 
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slightly shifted upwards in order better to display the graphs. The solid lines show 
the corresponding approximations. 
We use the following methods 

Method 1: v = 0, and s = h, r = h/c. 

Method 2: v = 0, and s = h/2, r = h. 

Method 3: v = 0, and s = ft/2, r = 3At/2. 

Method 4: i/ = 0, and s = 3ft/4, r = 5At/3. 
to obtain some numerical experiments. The numerical results which are illus- 
trated in figures (2)-(5) show the efficiency of our methods. Figure (3) shows that 
the method 2 is not convergence when h — At — 0.01. But for smaller At (i.e. 
At = .001) it will be convergence. In comparison with [3], we see that those meth- 
ods have the same properties reported in [3]. Now the important question is how 
should we choose s and r such that the introduced method be convergence? An- 
other question is what is the best choice? We try to answer this questions in our 
next works. 
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COMPARING THE BOX-JENKINS MODELS BEFORE 
AND AFTER THE WAVELET FILTERING IN TERMS 
OF REDUCING THE ORDERS WITH APPLICATION 

QAIS MUSTAFA AND TAHA H.A.ALZUBAYDI 



Abstract. In this paper , the estimated linear models of Box-Jenkins has 
been compared from time series observations , before and after wavelet shrink- 
age filtering and then reducing the order of the estimated model from filtered 
observations (with preserving the accuracy and suitability of the estimated 
models) and re-compared with the estimated linear model of original obser- 
vations , depending on some statistical criteria through taking a practical 
application of time series and using statistical programs such as Statgraphics 
, NCSS and MATLAB. 

The results of the paper showed the efficiency of wavelet shrinkage filters 
in solving the noise problem and obtaining the efficient estimated models , 
and specifically the wavelet shrinkage filter (draey) with Soft threshold which 
estimated it's level using the Fixed Form method of filtered observations , and 
the possibility of obtaining linear models of the filterd observations with lower 
orders and higher efficiency compared with the corresponding estimated model 
of original observations 



1. Introduction 

Time series analysis has big importance in studying the behavior of different 
phenomena and it represents an important base for many fields like forecasting , 
therefore we see its applications had expanded for many fields ,for example applied 
researches , engineering sciences , medicine , physics , soil science , finance and 
economic... etc. And because the observations of time series are be affected by 
many unknown natural factors , therefore these observations contains ratio of noise 
which is defined as unwanted data and usually has small value and high frequencies 
and as a result for this it contaminates the real observations and cause difficulty 
in the process of analyzing the time series observations like diagnostic , estimation 
and forecasting. 

The process of reducing the noise or removed before analysis the time series is 
very important in order to obtain more accurate and reliable results when building 
models .The wavelet shrinkage technique consists of wavelets with threshold is a 
strong mathematical approach to remove most of the noise while retaining the 
maximum amount of energy data that represents the real observation. 

Wavelet transform have been used in many fields. It has been observed that 
the accuracy of the forecasting can be improved through using wavelet trans- 
forms. Rumaih M.and Mohammad A., in 2002[l]used Saudi stock index to show that 
wavelet transform is better than the other forecasting technique in predicting the 
de-noising of the financial time series , and this is done after making a comparison 



Key words and phrases. Box-Jenkins, time series, wavelet shrinkage, filters, Fixed Form. 
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with several forecasting models. Alwadi S. ,Mohd Tahir Ismail, Alkhahazaleh and 
Samsul ArifSn Addul Karim,in 2011 [2] used the wavelet transform to decompose 
the return Amman stock market in to a set of better behaved series data and ex- 
plained that after making a comparison framework, the wavelet ARIMA model is 
better than the ARIMA model of the original data and gives more accurate results 
and also gives data with more stable in variance , mean and no outliers. 

The basic idea of this paper is to see the method of the wavelet transforms in 
general and wavelet shrinkage in time series analysis in particular . Moreover ,the 
study try to show the advantages in modeling and forecasting through de-noising 
the series using wavelet shrinkage and try to get the ability of lowering the order 
of the estimated model using wavelet shrinkage. Batch of chemical process data[3] 
was used for analysis. 

Two different methods were considered here, as showed in Figure 1. In the first 
method, the batch of chemical process data is modeled using Box - Jenkins method. 
Then some forecasting criteria were computed. In second method the technique of 
wavelet shrinkage was used through filtering the time series data using a set of 
wavelet filters. The de-noised series are modeled ,as in the first method ,then the 
forecasting criteria were computed again. The forecasting criteria were evaluated 
and compared to those of the first method. 
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(a) First method 



Data 



De-noise 
Using wavelet shrinkage 
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Forecast 



(b) Second method 
Figure 1. Modeling Methods 

The remainder of this paper is structured as follows. In section 2 we review the 
simple brief introduction to wavelet transform. Section 3 gives time series de-noising 
using wavelet shrinkage .Section 4 deals with main results before and after reducing 
the order of the model using wavelet shrinkage and gives discussion. And finally in 
section 5 conclusions are presented. 

2. The Wavelet Transform 

The mathematical transformations are applied to a signal to get additional in- 
formation that is not found in the time domain representation of that signal. First 
we consider the Fourier transform , which decomposes signals in to sum of peri- 
odic bases of finite length and can transform the signal from time domain to the 
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frequency domain and vice versa. It is defined mathematically as follows: 

/oo 
x (t) .e-i wt dt 
-oo 

Where X(f) is the Fourier transform of the signal x(t) .The problem here is that 
the Fourier transform becomes non active for signal changing over time , because 
it provides for us the information of frequency content. In other words the x(f) Is 
not a function of time .For this reason the Fourier Transform extended to so called 
Short-Time Fourier transform(STi* 1 T).It is defined as: 



(2) 



STFT$ v) (t',f) = 



[x{t).w*(t-t)}.e-^ ft dt 



Where t is the shift factor , w(t) is the window function , and * is the complex 
conjugate. The STFT gives us a compromise of sorts between time and frequency 
information .The accuracy is limited by the size and shape of the window. For 
example , using many time intervals would give good time resolution but the very 
short time of each window would not give us good frequency resolution, specially for 
lower frequency signals [7]. The frequency component of a signal at a particular time 
instant cannot exactly be determined. This comes directly from the Heisenberg's 
uncertainty principle , which states that the momentum and position of moving 
particle cannot exactly be determined. 

This shortcoming was overcome by the development of the wavelet transforma- 
tion. Wavelet transforms allow us variable-size windows. We can use long time 
intervals for more precise low frequency information and shorter intervals (giving 
us more precise time information) for the higher frequencies. Figure 2 shows the 
partitioning of the time- frequency plane by different techniques [8]. 
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Figure 2. The partitioning of the time - frequency plane by different techniques 
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The wavelet transform is defined as follows: 

(3) *t(x,s) = J ' x(t).r T Jt)dt 

where: 

(4) 






Where s is the scale variable and r is the translation variable. When substituting 
this description in (2.3) gives the definition of the continuous wavelet transform 
CWT: 



(5) 



CWT+(t,8) = 



1 



x(t).ip 



dt 



As seen in the above equation ,the transformed signal is a function of two variables 
, r and s, the translation and scale parameters respectively. The translation r is 
proportional to time information and the scale s, is proportional to the inverse of the 
frequency information. To find the constituent wavelets of the signal , the coefficients 
should be multiplied by the relevant version of the mother wavelet ^[10]. 

Discrete wavelet transform DWT analyzes the signal at different frequency bands 
with different resolutions by decomposing the signal into a coarse approximation 
and detail information. DWT employs two sets of functions , called scaling func- 
tions and wavelet functions , which are associated with low pass h[n] and high pass 
g[n] filters . The original signal is passed through the both half band filters. After 
filtering half of the sample can be eliminated according to the Nyquist's rule (i.e.; 
half of the samples are redundant). The signal can therefore be sub-sampled by 2 
, simply by discarding every other sample. This process can be repeated for further 
decomposition [11]. 

Figure 3 represents the wavelet decomposition tree of DWT coefficients for three 
levels decomposition , where s is the original sequence , cA and cD are approxima- 
tion and details(i.e.; low pass and high pass analysis filters) [10]. 



i: 



L 



> 



CA. 



y 



C/K- 



1 



CO 



] 



ezO- 



c O 



Figure 3. Wavelet decomposition tree of the DWT coefficients for tnree levels 

Note that the original signal can be synthesized by the low and high frequencies 
sequences by assembly sequence for each assembly parts resulting from the previous 
parts (approximation parts and detailed) started from the last stage of analysis. 
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3. Time series analysis using wavelet shrinkage 

Statistical wavelet methods provide a powerful tool to recover the original signal 
from noisy observations under general assumptions , we refer the reader to [5] 
for more details. The generic methodology is called wavelet shrinkage or wavelet 
thresholding. The underlying model for the noisy series can be expressed as follows: 

(6) X(t) = s(t) + n(t) 

Where n(t) represents the independent and identically distributed random vari- 
ables with zero mean and finite variance. The objective is to suppress the noise part 
of the series x(t) and recover the clean part s(t) The procedure can be summarized 
as three main steps [5, 12] : 

1-The discrete wavelet transform DWT is computed from the data. 

2-Significantly large coefficients in the DWT are kept , others are shrinked. 

3-The inverse DWT is applied to the shrunken set of coefficients. 

The three steps above can be summarize in figure 4: 
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Figure 4. Wavelet shrinkage chart 

For each level , we will have a threshold . The Fixed Form threshold (i.e.; Universal 
threshold) technique is considered and is given by the following formula: 

(7) 5 (FT) = a(MAD) v / 2log(N) 

Where (N) is the number of wavelet coefficients in specified level , a(MAD) is the 
estimate of the noise standard deviation and can be obtained by applying a median 

N 
absolute deviation (MAD) estimator to the — wavelet coefficients at the first 

level of decomposition , incorporating a scale factor equal to (0.6745) [6]. After 
estimating the threshold of a specified level , wavelet coefficient of that level are 
either hard or soft threshold. Hard thresholding represents the keep or kill (i.e.; 
wavelet coefficient is less than the threshold it would be put to zero otherwise it stays 
without change) . The soft thrcsolding shrinks all non zero coefficients towards zero 
, which gives a smooth de-noising The soft threshold de-noising function formula 
can expressed as follows : 

dj,k, \dj,k\ > S, 
0, \dj, k \<0 

Where dj^ denotes the coefficient of transformation and 5 is the threshold.lt 
was shown in [4] that soft thresholding has smaller variance than hard thresholding 
, therefore here we only consider soft thresholding for modeling and forecasting the 
yields data of batch chemical in this paper. 

4. Main results and discussion 

In this section , an application was considered so as to show first the ability of 
wavelet shrinkage to reduce the noise from original data , and second to get a model 
with a lower order and higher efficiency compared with the original. Figure 5 shows 



(8) 



dj,k = 
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five levels multiresolution wavelet analysis using Haar wavelet for the yields from 
a batch chemical process for 65 consecutive observations , where s is the signal and 
it is the sum of its approximation and of its hne details , a§ is approximation at 
level 5 and ds, cLt, c?3, g?2, d\ are the details at level 5,4,3,2 and 1. 




Figure 5. Five multiresolution wavelet analysis using Haar wavelet for the yields from 
K batch chemical process 



Now , we will use the two different methods mentioned in the introduction as follows: 
Method (1): The yields of batch chemical process was modeled as ARMA(2, 0) , 
the parameters (p = 2,q = 0) were selected after careful modeling and fitting 
(Statgraphics software was used for modeling). The performance measures (i.e.; 
the forecasting criteria) used in the analysis are Root Mean Square Error (RAISE) 
, Mean Absolute Error (MAE) and Akaike's Information Criterion(AJC) are com- 
puted as the following [9,13,14]: 



(9) 



RMSE = 



TIU. 



(10) 



MAE = 



E£=i N 



(ii) 



AIC = lr 



2k 



Where a t represents the difference between actual and forecasted value , (k) is 
the number of estimated parameters of the model and (n) is the sample size. 
Method(2):The yields from a batch chemical process noise model was considered. 
The data was de-noised using wavelet shrinkage technique mentioned in section (3) 
(using MATLAB software , version 2008) with four different wavelet families . It 
is important here to say that after many experimental trials with many wavelets 
families , it was found that these wavelets better than others in terms of de-noising 
the batch data, and they are [7]: 
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1- Haar wavelet with five multiresolution levels. 

2- Daubechies wavelet of order (5) and five multiresolution levels. 

3- Coif lets wavelet of order (3) and five multiresolution levels. 

4- DiscreteMeyer (dmey) wavelet with five multiresolution levels. 

For each selected wavelet, the batch data was first analyzed for five multireso- 
lution levels and de-noised using the Fixed Form threshold . After de-noising, the 
series were modeled using Box- Jenkins method and forecasting criteria were com- 
puted and compared with those in the first method mentioned before. Table (1) 
shows the performance measures (i.e.; forecasting criteria) for ARMA(2, 0) model 
of the original and de-noised data using the Fixed Form thresholding. It is clear 
from the table that when the ARM A model was used , wavelet de-noising reduces 
forecasting errors depending on forecasting criteria. The reduction is in highest level 
when the batch data was Fixed Form thresholded using Discrete Meyer Wavelet 
(note from the table 1, the best reduction comparing with all other forecasting 
criteria used here). 

Filtered observations using wavelet shrinkage filters were modeled once again 
for each wavelet filter, but this time as ARMA(1,0) when rebuilding the model 
instead of ARMA(2, 0) after careful modeling and without effect on the suitable 
and accuracy of the model , then compared with the original model ARMA(2, 0). 
Table (2) shows the performance measures for ARMA(2, 0) model of the origi- 
nal and de-noised data when rebuilding ARMA(1,0) model ,then using the Fixed 
Form thresholding. It is clear from the table that when the ARM A model was 
used , wavelet de-noising reduces forecasting errors depending on forecasting crite- 
ria. Once again the reduction is in highest level when the batch data was Fixed 
Form thresholded using Discrete Meyer Wavelet (note from the table 2, the best 
reduction comparing with all other forecasting criteria used here). 

Table 1: RMSE , MAE and AIC for forecasting model of the batch data 
comparing with the de-noised data using Fixed Form thresholding. 



Method 


Kind 


RAISE 


MAE 


AIC 


First method (Box-Jeiildiis} 
Original data 


AR\iA(2,6) 


10.5934 


7.7S54 


4.S137 


Second method (Fixed Form) 
De-noised data 


Haar 


8.7989 


6J515 


4.1415 


DaiibecMes (5) 


82375 


6.0619 


4J097 


Coiflet (3) 


82192 


6.0618 


4.3053 


Discrete Meyer(djn^) 


7.9810 


5.9255 


42464 



Table 2: RMSE , MAE and AIC for forecasting model ARMA(2, 0) of the batch 
data comparing with the de-noised data as ARMA(1,0) using Fixed Form thresh- 
olding. 



196 



QAIS MUSTAFA AND TAHA HAALZUBAYDI 



Method 


Kind 


RMSE 


MAE 


AIC 


First method (Box-Jenkins) 
Original data 


ABMAQjB) 


10.5SS4 


7.7S54 


4.S137 


Second method (Fixed Form) 
De-noised data 
msARMA(l,0) 


Haa? 


S.S602 


S.55S5 


4.424S 


Daubechies (5) 


SJ269 


SJ3S3 


4J000 


Coijlst (i) 


S.2022 


6.1242 


4.2703 


Discrete Meyer(dm^) 


7.9555 


5,9805 


42092 



5. Conclusions 

1- More information could be obtained from a series When using Wavelet Shrink- 
age technique , specially the wavelet filter Discrete Meyer , forecasting errors can 
be reduced depending on some performance measures . 

2- The ability of lowering the order of the estimated model of batch data for 
filtered observations using Wavelet Shrinkage as a result of reducing the noise with 
keeping the efficiency and suitability of that model. 
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REDUCED BIAS OF THE MEAN FOR A HEAVY 
TAILED DISTRIBUTION 

RASSOUL ABDELAZIZ 

Abstract. For the estimation of the mean for a heavy tailed dis- 
tribution with index 1/2 < 7 < 1, The semi-parametric estimation 
of mean depends not only on the estimation of the tail index or 
extreme value index 7, the primary parameter of extreme events, 
but also on the adequate estimation of a scale first order parame- 
ter. Recently apart from new classes of reduced-bias estimators 
for 7 > 0, new classes of the scale first order parameter have been 
introduced in the literature. Their use in mean estimation en- 
ables us to introduce new classes of asymptotically unbiased mean 
estimators. The asymptotic distributional behavior of the pro- 
posed estimators of /x is derived, under a second-order framework, 
and their finite sample properties are also obtained through Monte 
Carlo simulation techniques. 



1. Introduction and preliminaries 

A model F is heavy-tailed whenever, the tail function F = 1 — F is 
regularly varying functions with a negative index of regular variation 
equal to — 1/7 (7 > 0) (F G 72V_i/ 7 ), or whenever the quantile function 

U(t) = F*-(l - l/t) = inf{:r : F(x) > 1 - 1/t}, 

where F^~ represented the inverse generalized of the df F, is of regular 
varying with an index 7 (U G 72. V 7 ), i.e., for every x > 



x 7 , 



ms v 1-F(tx) 1Av ,. U(te) 

(1.1) hm Vt - = x~ lh <^> hm \ / 

v ' t-00 1 - F(t) t-00 U(t) 

(see, e.g., de Haan and Ferreira, 2006, page 19). 

Such cdf's constitute a major subclass of the family of heavy-tailed 
distributions. Heavy tailed distributions have been applied in many 
different areas such as population size (Zipf (1949)), random graph 
(Reittu and Norros (2004)), internet traffic (Resnick (1997a)), hydrol- 
ogy (Katz et al. (2002)), finance (Danielsson and de Vries (1997)). 

Suppose that Xi, ...,X n are i.i.d. random variables with cdf F which 
has regularly varying tails with index 7 G (1/2, 1), then (1.1) is equiv- 
alent to the statement that F has infinite second moment. 



Key words and phrases. Heavy tails, index, high quantiles, mean, trimmed mean, 
reduced bias. 
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In this paper we interested by estimate of the mean for a heavy tailed 
distribution, that is 

r+oo 

fj, = E(X) = / xdF(x), 
Jo 

this may be rewrite in term of the quantile, as follow 

\x = / Q(l — s)ds. 
Jo 

where Q(s) := mi{x : F(x) > s}; < s < 1, denotes the quantile 

function of the df F. In this case, to obtain a consistent estimator of 

the mean for any 7 e (1/2, 1) , Peng (2001) partitioned the mean fi 

into 

rk/n pi 

fj, = / Q(i_ s )d s+ / Q(l - s )ds 

Jo Jk/n 

= « (1) + « (2) 
H'n ' f^n i 

where k = k(n) is an intermediate integer sequence satisfying the con- 
dition 

(1.2) k — * 00 and k/n — > as n — ► 00. 

Then //„ and /in are estimated separately. One obvious estimator for 
jjL n is the trimmed mean 



1 "• "■ 

(1-3) iui 1} = -^^, n , 

i=l 

which has been studied extensively (see Csorgo et al., 1986a,b, GriEn 

and Pruitt,1989). 

(2) 
The Extreme Value Theory is used to estimate /A , the mean for the 

right tail as follows: For a small value of p, we want to estimate the 

quantil Xi- P i such that F(xi~ P ) = 1 — P- More specifically, we want to 

estimate 

(1.4) Xi- P = U (1/p) , p = Pn - 0, np n -* K e [0, 1] , 

and we shall assume to be working in Hall's class of models (Hall 1982; 
Hall and Welsh, 1985), where there exist 7>0, p<0,C>0 and 
(3 7^ such that 

(1.5) U(t) = Ct 7 (l + A(t)/p + o(t p )), as t -+ 00. 
Since, from (1.4) and (1.5), we have 

Xi- P = U (1/p) ~ Cp^ 1 as p -+ 0. 

An obvious estimator of Xi- P * s Cp _7 , with C and 7 any consistent 
estimators of C and 7, respectively. 

Let Xi jn , ...,X n>n . denote the order statistics of Xi, ...,X n . Denoting Y 
a standard Pareto model, i.e., a model such that Fy(y) = 1 — 1/y, 
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y > 1, the use of the universal uniform transformation enables us to 

write X n _k,n = ^J(Yn-k,n)- Next, since Y n _k,n ~ (n/k) for intermediate 
k satisfiying (1.2), we get 

X n - k ,n ~ CY^_ Kn ~ C(n/ky, asn^oo. 

Consequently, an obvious estimator of C, proposed by Hall and Welsh 
(1985), is 

(1.6) C^(k)=X n ^ n (k/nf, 
then 

(1.7) Q^(k) = X n ^ n (k/npf. 

is the obvious quantile-estimator at the level p (Weissman, 1978). 

(2) 

Peng (2001) using this estimator for estimate /m , as follows 

(1-8) /#> = "*„-*,„: ' 



n ' 1 - In 

where 7 n is Hill estimator of 7, defind as follows 

1 k 

(1.9) 7 n = - ^2 \°g( X n,n-i+l) ~ log(X„ in _^ 

Finally, the result of Peng is as follows 

-k 



1 u 

(1.10) ij„-Vi,,„ + -i 



n—k,n . 



n ^ n l - 7„ 

The details on asymptotic distributions for the estimators fi can be 
found in Peng (2001). The Hill estimator of the index plays a central 
role in modelling the tail distribution. 

The consistency property of the Hill's estimator is given in Mason 
[1982]. The asymptotic normality of the Hill's estimator has been 
demonstrated by many authors, see for example, de Haan and Resnick 
[1980], Csorgo and Mason [1985], Hausler and Teugels [1985], and 
Beirlant and Teugels [1987]. The detailed explanation of mathemat- 
ical properties of the Hill's estimator can be found in Beirlant et al. 
[2004]. To be able to conduct reliable inference about tail properties 
of the heavy tailed distribution it is important to determine the op- 
timal threshold k. A number of methods for estimation of optimal k 
were proposed in the literature (Dekkers and de Haan [1993] , Hall and 
Welsh [1985], Beirlant et al. [1996a], Beirlant et al. [1996b], Draisma 
et al. [1999]). Most of the proposed methods are based on minimising 
the asymptotic mean squared error (AMSE) of the tail index estimate 
for a given sample size and corresponding fraction of extreme value 
statistics. 

As Peng's estimator /i n is based on Weissman's estimator Q~' (k) , 
so known to be largely biased. 
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Our main aim in these paper is essentially to present a new reduced 
bias estimators for the mean p, valid in a wide sub-class of Hall's class of 
models (Hall, 1982), and in the lines of both Matthys et al. (2004) and 
Gomes and Figueiredo (2006), i.e., based on a direct accommodation 
of the bias for high quantiles and an adequate reduced-bias estimation 
of the tail index , respectively. The key feature of these new estimators 
exists in the fact that the estimation of the second-order parameters 
in the respective bias-terms, described in Section 2 of this paper, is 
performed at a level k\ of a larger order than the level k at which 
the parameters 7 and x P are classically estimated. Doing this, we are 
able to guarantee a mean squared error smaller than of the classical 
estimators for all levels k and for mean estimation. In addition, the 
asymptotic behavior of the new estimator of the mean will be derived 
under appropriate higher order conditions, in Sections 3 and 4, respec- 
tively. The simulation study in Section 5 will enable us to obtain some 
of the features of these new estimators for finite samples, the proof of 
the main result are postponed to section 6. 

2. Second order conditions 

In order to derive the asymptotic non-degenerate behaviour of semi- 
parametric estimators of extreme events parameters, we need more 
than the first order condition in (1.3), a second order condition on F 
(see de Haan and Stadtmilller, 1996), assuming that there is a function 
A with constant sign near infinity, such that 

, N , U(tx)/V(t) - x^ ^x p -l „ 

(2.1) lim V >'\> = x 1 , for any x > 0, 

s^oo A(s) p 

where p < is the shape second order parameter. If p = 0, interpret 

xP/-'-l 



ph 



as log x. Or equivalently, 



In TJ(sx) — In U(s) — 7 In x x p — 1 „ 

2.2 lim y —^ — , v ; = , for any x > 0. 

A(s) p 



s^oo 



A typical condition for heavy-tailed models, which holds for the models 
in (1.5), with 

(2.3) A(t) = 7 /3t p with 7 > 0, p < and (3 ^ 0. 

Under the second order framework in (2.1) or in (2.2), and for interme- 
diate k, i.e., whenever (1.2) holds, we may guarantee the asymptotic 
normality of the Hill estimator 7 n , for an adequate k. Indeed, we may 
write (de Haan and Peng, 1998), 

^ d 7 A(n/k) , , .. 

(2.4) 7 „l 7+ ^ Zi + _iJJ (1 + 0p(1)) , 
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with Zk = yk (Si=i-^A ~~ l)> an d {-^j} i-i-d- standard exponential 
r.v.'s. 

Consequently, if we choose k such that yk~A(n/k) — > A 7^ 0, finite, 
as n — »■ 00, then vft (-y n — 7) is asymptotically normal, with variance 
equal to 7 2 and a non-null bias given by A/ (1 — p). Most of the times, 
this type of estimates exhibits a strong bias for moderate k and sample 
paths with very short stability regions around the target value. This 
has recently led researchers to consider the possibility of dealing with 
the bias term in an appropriate way, building new estimators, Jn(k) 
say, the so-called second order reduced-bias estimators discussed by 
Peng (1998), Beirlant et al. (1999), Feuerverger and Hall (1999), Gomes 
et al. (2000), among others. Then, for k intermediate, satisfying (1.2), 
such that (2.1) holds, and under the second order framework in (1.10), 

R 

k 



we may write, with Zj? an asymptotically standard normal r.v. 



(2.5) %(k) = 7 + ^Z* + o P (A(n/*)) , 

where <7r > and A is again the function in (1.10). Consequently, the 
sequence of r.v.'s, yki^y R (k) — 7) is asymptotically normal with vari- 
ance equal to (7<tr) and a null mean value even when ykA (n/k) — > 
A 7^ 0, finite, as n — ► 00, possibly at the expense of an asymptotic 
variance (70^) > 7 2 . Gomes and Figueiredo (2006) suggest the use, 
in (2.1), of reduced-bias tail index estimators, like the ones in Gomes 
and Martins (2001, 2002) and Gomes et al. (2004), all with a R > 1 in 
(2.5), being then able to reduce also the dominant component of the 
classical quantile estimator's asymptotic bias. 

More recently, Caeiro et al. (2005), Gomes et al. (2007a, 2007b) 
consider new classes of tail index estimators, for which (2.5) holds with 
gr — 1 at least for values k such that ykA (n/k) — ■> A, finite. These 

classes are dependent on f/3,p], an adequate consistent estimator of 

the vector of second order parameters (/3,p) in (2.3). The influence of 
these tail index estimators in quantile estimation has been studied by 
Gomes and Pestana (2007) and Beirlant et al. (2006). 

3. Reduced bias of index and extreme quantile 

The second order reduced-bias extreme value index estimator intro- 
duced in Caeiro et al. (2005), is given by 

(3-1) 7* (*) = l Rrp$ (k) = % ( 1 - y-^ ^ 

for adequate consistent estimators p and /3 of the second order para- 
meters p and (3 respectively. 
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Also recently, new estimators of C have been proposed (Caeiro, 
2006), where, instead of X n _k,n alone, a spacing X n _^u n — X n _k,n, 
< 6 < 1, is considered. More specifically, we may replace C in (1.6) 
by 

/oo\ n (l, n\ _ -X-n-[k8]:n — A n _ fe:n / k \ 

(3.2) G^ (k, 6) - ^— -^ ^- 

where < (9 < Lis a tuning parameter and 7^ = 7^ (k) is a second 
order reduced-bias extreme value index estimator. Similarly to the 
way developed by Caeiro et al. (2005) for the extreme value index 
estimation, Caeiro (2006) has worked out the main dominant compo- 
nent of the asymptotic bias of C^ R [k,8). With the parametrization 
A(t) = 7/3f, already given in (2.3), such a component is given by 
C x (1 - B e (7,p,/3)), where 

*w m 0-^ +p) -l 7 /3(n/A;) p 

(3.3) B e (7, P, 0) = g _ 7 _ 1 x WK < > . 

It is thus sensible to consider the semi-parametric C— estimator, 

(3.4) C~ 1R (M) = Xw ^::f^ :W (1)" x (l - Be (7*,?,?)) • 
We shall here consider, for = 1/2, the associated quantile estimator 

Qk (*) = Q P ^ rp $ (*) with 

(3.5) 

Q W>3 (*) = 2^31 (- J >< i 1 - *V2 (7* P, /3 

3.1. Estimators of the shape second order parameter p. We 

shall consider here particular members of the class of estimators of the 
second order parameter proposed by Fraga Alves et al. (2003). Such 
a class of estimators may be parameterized by a tuning real parameter 
r G K (Caeiro and Gomes, 2004). These -estimators depend on the 
statistics 

(MP(fc)) T -(M, ( l 2) (fe)/2) T/ " 



(3.6) T^\k) 



r-i ,,,,,,.,,, ,.W3 , ifr^O 

ln(M4 1) (fc)) T -|ln(M, ( l 2) (fe)/2) T/2 



(jv4 2) (fc)/2) T -(Mi 3) (fc)/e) T 



;, ifr = 



iln(M^ ) (fc)/2) T/2 -|ln(M^ 3) (A : )/6) T/3: 

converge towards 3(1 — p)/(3 — p), independently of the tuning para- 
meter, whenever the second order condition (10) holds and k is such 
that (1.2) holds and yk. A(n/k) — > oo, as n — > oo, where 

k 

(3.7) M«(fc) = ^ (log(X n , n _ i+1 ) - log(X n ,„_ fe+1 )) r . 

4 = 1 
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The p-estimators considered have the functional expression, 

(3.8) p[ T \k) = -min ( 0, 3 Z«^1^ 1 

Remark 3.1. The theoretical and simulated results in Fraga Alves 
et al. (2003), together with the use of these estimators in different 
reduced-bias statistics, has led us to advise in practice the estimation 
of p through the estimator in (3.8), computed at any value 

(3.9) h := [n l - £ ] ,e>0. 

The value k\ in (3.9) is not necessarily optimal but, for a large class of 
heavy-tailed models, it enables us to guarantee that, with'p = p^ T '(&), p— 
p = o p (1/lnn) , as n — > oo. The choice of the tuning parameter r = 
for the region p G [— 1, 0) and r = 1 for the region p G (— oo, —1) is a 
sensible one. 

3.2. Estimators of the scale second order parameter j3. For the 

estimation of j3 we shall here consider the estimator developed in Gomes 
and Martins (2002), with the functional expression, 



(3.10) B~(k) ■■ . . 

where 

with p = Pn (k) and 



Ui = i (logX n _ m , n - logX 



n—k,n) j 



4. Reduced bias of the mean and the main result 

bstituting 
tion, we obtain 



Substituting (X ~ ■% (k) for Q in expression of ph, , and after integra- 



-A-n— [k8]:n -^n—k:n I K 



6 lR - 1 \n 



x (l-B e (i R ,p,^j 



{il) _ ?) = ( ^ (x^-x^) x ^_ Bi/2 [%X0 



n 



Finally, our estimate of the mean is defined as follows 

(4-2) ' 

1 ^ — -v I k\ -X-n— \k/2Yn — -^-n—k-n ( ( ^ ^ ^ 

^ = n g X - + U ) &« - 1) (1 - 7 fl ) X i 1 " * 1/2 I 7 *' * P 
Our main result in the following theorem. 
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Theorem 4.1. Under the second order framework in (1.10) with A(t) = 
jl3t p , for an intermediate k, i.e., k such that (1.2) holds, whenever 
ln(np)/vk — ► 0, and Vk,A(n/k) — > A, as n — ► oo, /or p and /3 defined 
by (3.8), (3.10), such thatp-p = o P (l/lnn). T/ien 



(k/n)X n _ k 



(p n — //)—► TV (0, a 2 ) , as n — »■ oo, 



where 



-2 2 7 . 2 
a = 77^ TV + 7 



2 7 ln2 



(27-1) L(l-7) 2 (1-7) (2^-1) J 

1 2^ In 2 



+ 2 7 



7) 2 (1 " 7) (* 



5. Simulation study 



We use the R statistical software (Ihaka and Gentleman, 1996) to 
apply the above result to the most usual distribution of Hall class, 
namely, the Frechet model: F{x) = exp (— rr -1 / 7 ) for x > 0, (for which 
| < 7 < 1, C — 1, p — -1 and /3 = 1/2). We simulate 1000 random 
samples of size n, where n =500; 1000 and 2000. We assume that the 
second order parameters p and (3 are unknown and they are estimated 
through (3.8) and (3.10), respectively, both computed at the level k\ = 
[n ' 995 ], i.e., we have chosen e = 0.005 in (3.9). 

In the first part, and by using the results of theorem (4.1), we fix 
C € ]0, 1[ and q^/ 2 is the (1 — |)— quantile of the standard normal 
distribution A/"(0, 1). The bounds of confidences of the mean are given 
by 

(n/k)_ 
p = p n ± q^ /2 —7=-o-. 

Now, we compare our estimator of the mean with the estimator of Peng 
(2001), in terms of bias and rmse, 

6. Proofs 
Let us consider that 

~Wn~ t 1 = An,l + A n>2 , 



where 



and 



.. n— k „i 

A n ,i = Tl,, B - Q(l-s)ds 

n j-[ Jk/n 



A„,^ ["*) ^gg^Sg x^-^ (w.g))-f °Qd - .)*■ 
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TABLE 1. Analog between the new estimator of the mean 
and the estimator of Peng (2001) for two tail index. 



7 


2/3 


3/4 


n 


500 


1000 


2000 


500 


1000 


2000 


n 


2.87635 


3.6380 


Pn 

bias 
RMSE 


2.9807 
0.1043 
0.0350 


2.9441 
0.0677 
0.00784 


2.8783 
0.0020 
0.0055 


3.7638 
0.1258 
0.5068 


3.6423 
0.0429 
0.3459 


3.6241 
-0.0139 
0.0262 


bias 
RMSE 


3.2888 
0.4125 
0.1762 


2.6797 
-0.2551 
0.0795 


2.6911 
-0.1966 
0.0652 


3.9629 
0.3248 
0.7612 


4.02021 
0.3821 
0.6635 


3.3675 
-0.2704 
0.1301 



A n 2 may be rewriten as follow: 



A 



■ii.2 



K \ -A-n—[k/2]:n -^n—k:n 

n) (1-7*) 

x (l-B 1/2 (^ R ,p,p 

k\ X n _[ k /2] :n — X n _ k 



( 2 7* - 1) (27 - 1) 



+ 



n 



(2t - 1; 



x [l-B 1/2 [j R ,p,/3 



1.(1-7*) (1-7) J 



k\ X n _ 



X n -l 



n—[k/2]:n -A-n—k:n 

nj (27 - 1) (1 - 7 ) 
x ( B 1/2 (j R , p, /?) - Bi /2 (7, P, P) 

™ \ -A-n— [k/2]:n -^-n—k-.n 



+ 



nj (27-l)(l- 7 ) 



rk/n 

x(l-B 1/2 ( 7 ,p,/3))- / Q(l- s)ds. 

Jo 



Now, with the use of the delta method enables us to write 



2^ In 2 



( 2 7 fl _ l) (27 - 1) ( 2 7fl - 1) : 



(7fl - 7) 



1 p 1 n \ 



(1-7*) (1-7) (1-7) 



207 



10 

and 



A. RASSOUL 



#1/2 \1r,P,P 
Bl/2(l, P,P) 



1+1 



72 7 ( 2 P - 1) \ tf R - 7) 



+ 



0-0 



/? 



(27- 1)(2t+p- l)y 7 



+ (p - p) In (n/fc) . 



As: 1r- 1 = °p (!)> (P - P) ln ( n A) = °p (1) and - /3 = o p (1), we 
have 

Bi/ 2 (7fl,P,3)=B 1/2 ( 7 ,/9,/3)(l + o p (l)). 
Either, with Q(l - s) = Cs' 1 , then 

-k/n 






-A-n— [k/2]:n -^-n—k-.n 



c — ( k / n y 

1-7 



(27 - 1) 



(k/n) 



1-7 



;i - b (7, p, /?)) 



where 



Then, 



Q — Xn-[k/2]:n -X-n~k:n ffcl \1 



k\ X n _[ k /2] :n — X n _ k:n 



nj (27-1) (l- 7 ) 
as n — > 00. We obtain that 



(27 - 1) 



x(l-B 1/2 ( 7 ,p,/3))- 



k/n 



Q (1 - s) ds = Op(l) 



v 7 *? 



(k/n)X n _ k ., 



-A 



»,2 



1 



2^1n2 



.(1-7) 2 (1- 7) (27-1) 
With the result of Gomes et al. (2007), we have 

//e (7 ^_ 7 )^A/-(0,7 2 ), 



^k(lR-l)+o p (l). 



as n — ► 00. 



Let us now consider the asymptotic distribution of A„i. It is shown in 
Csorgo and Mason (1985) that, there exists a sequence of independent 
Brownian bridges {B n (s); < s < l} n>1 such that, for all large n 



Vk 



(k/n)X n _ k: , 



-A 



J*B n (l-s)dQ(l-s) 



n.2 



I k/n 



Finally: 



Vk 



(k/n)X n - k: . 



(k/n)Q (1 - k/n) 

(K ~ P) = A n (7, P) 



+ o v (1) . 
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where 



A n (7) 



2^1n2 



(1 - 7) 2 (1 " 7) (2f - 1) 



(1r ~ 7) 



(k/n)Q (1 - k/n) 



+ o p (l). 



It is clear that A n (7) is a Gaussian random variable with mean zero 
and variance 

E (A n ( 7 )) 2 = E (Wx) 2 + £ (W 2 ) 2 + 2E (W1W2) . 

An elementary calculation gives, as n — ► 00 

1 2 7 ln2 



^(W!) 2 = 7 2 



:i-7)' 



£(W 2 



27 



(27 
2£ (W1W2) = 2 7 



+ o(l] 



7) (2i 



2^1n2 



+ o(l) 



+ o(l) 



L(l-7) 2 (1-7) (2^-1) J 

Summing up the right-hand sides of the above three limits, we obtain 
a 2 , whose expression in terms of the parameter is given in Theorem 

(4-1). 

This completes the proof of Theorem (4.1). 
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ON THE HYERS-ULAM STABILITY OF NON-CONSTANT 
VALUED LINEAR DIFFERENTIAL EQUATION xy' = -Xy 

HAMID VAEZI AND HABIB SHAKOORY 

Abstract. Wc consider a diffcrcntiable map y from an open interval to a real 
Banach space of all bounded continuous real-valued functions on a topological 
space. We will investigate the Hyers-Ulam stability of the following linear 
differential equations of first order with non-constant values: 

xy' = -\y, 

where A is a positive real number and 

y e C{I) = C(a, b), - oo < a < b < +oo, x e (0, oo). 



1. Introduction 

C. Alsina and R. Ger [1] remarked that the differential equation y' = y has the 
Hyers-Ulam stability. More explicitly, they proved that if a differentiable function 
y : I — > R satisfies \y'(t) — y(t)\ < s for all t € I, Then there exists a differentiable 
function g : I — > R satisfying g'(t) — g(t) for any t € I such that \y{t)—g(t)\ < 3e for 
every t € I. T. Miura , S.-E. Takahashi and H. Choda, [3] consider a differentiable 
map y from an open interval to a real Banach space of all bounded continuous real- 
valued functions on a topological space. They show that y can be approximated by 
the solution to the differential equation x'(t) — Xx(t) if \\y'(t) — At/(i)|| 00 < e holds. 
In [4] , this result was generalized to the case of the complex Banach space valued 
differential equation y' — Xy. Y. Li [2] prove stability in the sense of Hyers-Ulam of 
differential equation of second order y" = A y. In this article, we say that equation 
xy' = —Xy has the Hyers-Ulam stability if there exists a constant K > with the 
following property: for every x, e > 0, y € C{I) if 

\xy' -{-Xy)\ <e, 

then there exists some z € C(I) satisfying 

xz + Xz = 

such that 

\y{x) — z(x)\ < ke. 
We call such k as Hyers-Ulam stability constant for equation 

xy' = -Xy. 



Key words and phrases. Hyers-Ulam Stability, differential equation, approximation. 
2010 AMS Math. Subject Classification. 34K20; 26D10. 
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2. Definitions 



Definition 2.1. Let B be a Banach space, y a map from, I {denotes an open interval 
of the real number field R) into B. We say that y is differ entiable, if for every t € I 
there exists an y' G B so that: 

lim\\(y(t + S )-y(t))/( S )-y'(t)\\ B =0, 

where \\-\\b denotes the norm on B. Let y be a differ entiable function on I in to 
R. Alsina and Ger [1] gave all the solutions to the inequality \y'(t) — y(t)\ < e 
for every ( € /. Then they showed that each solution to the inequality above was 
approximated by a solution to the differential equation z' (t) = z(t). 

In accordance with [1], we define the Hyers-Ulam stability of Banach space valued 
differcntiable map. 

Definition 2.2. Let B be a Banach space, y a differ entiable map on L (denotes an 
open interval of the real number field R) into B so that for each t G / : 

\\y'(t)-Xy(t)\\ B <e. 

We say that Hyers- Ulam stability holds for y if there exist a k > and a differen- 
tiable map z on I into B such that: 

z'(t) - Xz{t) = 

and 

\\y(t) - z(t)\\ B < ke 
hold for every t £ /. 

3. Main results 

Now, the main result of this work is given in the following theorem. 

Theorem 3.1. // a continuously differ entiable function y : L — > R satisfies the 
differential inequality 

\xy' + Xy\ < e 

for all t G / and for some e > 0, then there exists a solution v : I — > R of the 
equation: 

xv + Xv = 
such that 

\y(x) -v(x)\ < ke, 
where k > is a constant. 

Proof. Let e > and y : L — > R be a continuously differentiable function such that 

\xy' + Xy\ < e. 

We will show that there exists a constant k independent of e and v such that 
\y — v\ < ke for some v G C(L) satisfying : 

xv + Xv = 0. 

If we set 

x — e u , x G (0,oo), 

xy' = x.dy/dx — x.dy/du.du/dx — x.l/x.dy/du — dy/du = Dy 
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according the above: 

x v' = d y/ du = yu> 

then 

|y'(w) + Ay(w)| < e, 
cquivalently, j/ satisfies 

— e < y'(u) + Ay(u) < e. 
Multiplying the formula by the function e A ", we obtain 

-ee Xu < y'(u)e Xu + Xy(u)e Xu < ee Xu 

For the case < A < 1, there exists M > such that MX > 1, so without loss of 
generality, we may assume that A > 1, thus 

(3.1) -\ee Xu < y'(u)e Xu + \y(u)e Xu < Xee Xu . 

For some fixed c G (a, b) with y(c) < oo and any u G (c, 6), integrating (3.1) from c 
to u, we get 

/>U f-U f-U fU 

-Ae / e Xu < / y'(ii)e Au + A / y(u)e Aw < \e / e Au , 

^c ^c J c J c 

and so 

_ ee M«- C ) < y ( u ) e *" _ 2/(c)e Ac < +ee x{ - u - c l 

Summing parts of equation with +ee Xc : 

-ee x ^ < y(u)e Xu - y{c)e Xc + ee Xc < +ee x ^ 

Multiplying the formula by the function e~ Xu , we get 

-e < y(u) - (y(c) - e)e Xc e - Xu < e, 

so 

-e < y(u) - (y(c) - e)e Xc e - Xu < e. 
Then, 

-e < i/(«) - (y(c) - e)e x{c - u) < e. 
Let z(u) — (y(c) — e)e xlyC ~ u \ Then z(u) satisfies 

z'{u) + \z(u) = 

and 

|y(u) -z(u)| < e 
For any a £ (m, c), the proof is very similar to the above, so we omit it. We have 

—e < y(u) — z(u) < e. 

where k = 1 if we change x = e u and x G (c, 6). By an argument similar to the 
above for some fixed c G (a, o), we can show that there exists 

»w-«»)-*^-*-»/^- A, -" ) " s 

such that 

\y(x) -v(x)\ < e, 
which completes the proof. □ 
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THE APPROXIMATE SOLUTION OF MULTI-HIGHER ORDER 

LINEAR VOLTERRA INTEGRO-FRACTIONAL DIFFERENTIAL 

EQUATIONS WITH VARIABLE COEFFICIENTS IN TERMS OF 

ORTHOGONAL POLYNOMIALS 

SHAZAD SHAWKI AHMED AND SHOKHAN AHMED HAMA SALIH 



Abstract. The main purpose of this paper is to present an approximation 
method for multi-higher order of linear Volterra Intcgro- Fractional Differential 
Equations (m-h LVIFDEs) with variable coefficients in the most general form 
under the conditions. The method is based on the orthogonal polynomials 
(Chebyshev and Lcgcndre) via least square technique. This method trans- 
forms the fractional equation and the given conditions into matrix equations, 
which correspond to a system of linear algebraic equations with unknown coef- 
ficients and apply Gaussian elimination method to determine the approximate 
orthogonal coefficients. The proposed method including two new algorithms 
for solving our problem, for each algorithm, a computer program was written. 
Finally, numerical examples are presented to illustrate the effectiveness and 
accuracy of the method and the results arc discussed. 



1. Introduction 

In this work, we will consider the multi-higher order of linear Volterra Integro- 
Fractional Differential Equations (m-h LVIFDEs) with variable coefficients in the 
form: 

{n-l 
c a D^y(x) + E Pi{x) c a D^y(x) + P n {x)y{x) 
- fix) + E \fKe(x,t)°D* m -'y(t)dt, xe[a,b]=I 
£=0 a 

with the initial conditions: 

(1-2) [y {k \x)] { x = a)=y k - 

fc = 0, 1,. ..,/i — l;/i = max{rrii, m| for all i and £}, 
where y{x) is the unknown function, which is the solution of equation (1.1). The 
functions K e : S x R -> R, (with S = {(x, t) : a < t < x < &}); t = 0, 1, 2, . . . , m and 
f,Pi : I — >• R are all continuous functions .Where ai,/3j G M. + ,rrii — 1 < on < rrii 
and m\ — 1 < fit < m|, rtii = \ai\ and m\ = \fii\ for all i = (1 : n) and I = (1 : m) 
with property that:a„ > a n _i > . . . > a± > a$ = and fi m > fi m -\ > ■ ■ ■ > fii > 
fio = 0, and A is a scalar parameter. 

Actually most linear Volterra intcgro-fractional differential equations of multi- 
higher order with variable coefficients do not have exact analytic solutions, therefore 



Key words and phrases. Integro-Fractional Differential Equations, Caputo Fractional Deriva- 
tive, least-square technique, Orthogonal (Chebyshev, Legendrc) polynomial. 
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approximation and numerical techniques must be used. The area of orthogonal 
polynomials is a very active research area in mathematics as well as in applications 
in mathematical physics, engineering, computer science and others [7, 8]. 

AL-Rawi [3] and Qassim [14] used Laguerre polynomial to approximate the so- 
lution of the first kind integral equations depending on the principle of the least- 
squares data fitting. On the other hand Ameen [14] and Al-Ani [1], Kalwi [2] applied 
this techniques to treat numerically FIE's of second kind using Laguerre, Hermit 
and Legendre and Chcbyshcv polynomials, respectively. In this paper, we extend 
this method to further deal with our consider problem (1.1) with initial conditions 
(1.2). 

2. Preliminaries and Notations 

2.1. Basic Definitions. 

Definition 2.1. [16], A real valued function y defined on [a, b] be in the space 
C 7 [a,6],7 £ M., if there exists a real number p > 7, such that y(x) = (x — a) p y*(x), 
where y* £ C[a, b], and it is said to be in the space C"[a, b] iffy^ n ' £ C 1 [a 1 b],n £ No- 

Definition 2.2. [4, 5], Let y £ C\[a, 6], 7 > —1 and a £ R + . Then the Riemann- 
Liouville fractional integral operator a J x of order a of a function y, is defined as: 

X 

aJ^y{x) = ^y J(x - t) a - l y(t)dt, a > 

a 

aJ°y( x ) = !y( x ) = vi x )i a = ° 

Definition 2.3. [4, 5], Let a > and m = [a] .The Riemann-Liouville fractional 
derivative operator ^D x , of order a and y £ C^-^a, b), is defined as: 

*D«y(x) = DZ\J™~ a y{x) 

If a = m,m £ No, and y £ C m [a, b] we have 

*D x y(x)=y(x), *D™y(x) = y™(x) 

Definition 2.4. [6, 10], The Caputo fractional derivative operator ^D x °f order 
a £ K + of a function y £ CH\ [a, b] and m — 1 < a < m, m £ N is defined as: 

C a D a xV {x) = a JT- a D™y(x) 

Thus for a — m, m £ No, and y £ C m [a 1 b], we have for all a < x < b 

C a Dly{x)=y{x), C a D™y{x) = D™y(x) 

Note That: [4, 5, 6, 10] 

i: For a > and f3 > 0,then a J£(x - a)' 3 - 1 = r ^ a) {x - a)^ 01 ' 1 . 
ii: For a > , j3 > and y(x) £ C 7 [a, b], 7 > — 1, then : 

a J x a J x y(x) = a J x a j x y(x) = a J x y(x) 

iii: a D x A = ^ r(i-a)° and a D x A = °; A is an y constant ; (a > 0,a £ N). 
iv: c a D« y {x) = D™ a *J?- a y(x) * a J™-<*D™y(x) = ^D^y(x) ;m=\a] 
v: Assume y £ (7™! [a, b];a > 0, a ^ N and m = \a] Then ^ D x y(x) is 
continuous on [a,b], and \^D"y(x)] _ = 0. 
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vi: Let a > 0,m — \a\ and y <G C m [a, b], then, the relation between the 
Caputo derivative and the R-L integral are formed: 

m—l i k \, % 

c a D« a J«y{x) = y(x) a<x<b; a J« C a D«y{x) = y(x) ]T V -^- (x a) k 

k=0 

vii: %D%y(x) = ^D" [y(x) - T m _i [y; a]];(m - 1 < a < m) and T m _i [y; a] de- 
notes the Taylor polynomial of degree m—l for the function y , centered 
at a. 

viii: Let a > 0; m — [a] and for y(x) = (x — a)* 3 for some j3 > O.Then: 

f »/ ^G{0,1,2 m-1} 

[ or /3 ^ N anrf /3 > m - 1 

2.2. Orthogonal Polynomials. Orthogonal polynomials are classes of polynomi- 
als ifii; i = 0, 1, 2, . . . defined over a range [a, b] that obey an orthogonality relation 
J w(x)(pi(x)ifj(x)dx — SijCj, where w(x) is a weighting function and dij is the 
kronecker delta (equal to 1 if i = j and to otherwise) . If Cj = 1 then the polyno- 
mials are not only orthogonal, but orthonormal. Two of the most common set of 
orthogonal polynomials are Chcbyshcv polynomials Tk(x) and Legendre polynomi- 
als P k (x), [11, 12, 14]. 

2.2.1. Chebyshev Polynomials. The Chebyshev polynomials are orthogonal over 
[—1,1] with respect to the weight function w(x) = 1/vl-i 2 . The Chebyshev 
polynomials of orderfc > 0, T k (x), for — 1 < x < 1, given by the simple calculation 
forms: 

UJ 

(2.1) T k {x) = lt, { i^( k ~ r ) (2 " )fe ~ 2r ' T ° {X) = 1 ' k ^ 1 

where [*J is the floor function and T k (x) has k— distinct in [— 1, 1], the roots {xi} i=1 
have the expression: 

(2.2) a?i = coa( 7r( *" l ; 1/2) ); i = 0, 1,2, . . . ,k - 1 

k 

Also, the n— derivative of Chebyshev polynomials T k [x) is formulated as: 

mm i s { fcyL^-l t- 1 )" (k-r- 1)! r,n(r, \k-2r-n Af u -> 

d n T k (x) j 2 2^r=0 r! (fc-2r-n)! / ^^^ Z / K>n 

da-n = ^ 2 fe - 1 fc! i/ fc = n 

[ if k < n 

2.2.2. Legendre Polynomials. An important another set of the polynomial approx- 
imation over [— 1, 1] is Legendre polynomials, a simple calculation formula for it of 
higher order k > 0, is defined in terms of the sums: 



UJ 
(2.3) P k (x) = - lt J2(- 1 Y 



2 k 

r=0 



X k -2r 
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Also, the n— derivative of Legendre polynomials Pk(x) is formulated as: 

f 1 V^L^J (-1)" (2fc-2r)! h-lr-n Af u ■> „ 
d n P k (x) 2<= 2^r=0 r\{k-r)\ (k-2r-n)\ X % J K > n 

dx n = \ (2k)\/2 k kl if k = n 

[ if k <n 

Remark 2.1. [12], Sometime it is necessary to take a problem stated on an interval 
[a,b] and reformulates the problem on the interval [—1,1] where the solution is 
known. If the approximation /n{x) to f(x) is to be obtained on the interval [a,b], 
then we change the variable so that the problem is reformulated on [— 1, 1] Thus 
x = (^)t+(^), £ = 2(fff)-l where a < x < b; -1 < t < 1. 

2.3. Gaussian Quadrature Formulas. Gaussian quadrature formula can be de- 
veloped based upon the orthogonality property. All Gaussian quadrature share 
the following formula: J w(x)f(x)dx = X)fe=o ^kf{xk) where Xk,k — (0 : N) are 
called nodes which , here, associated with zeros of orthogonal polynomials are the 
integration points and A& , k = (0 : N) are called weights of the quadrature for- 
mula related to the orthogonal polynomials, [9, 12]. If the weights function is 
w(x) = l/yj(x — a)(b — x) on interval [a, b]. Then, the Open Gauss- Chebyshev 
quadrature rule has the form: 

f b i N-l 

where x k = ( b -^)z k + (*±*) and z k = co S (2MI§ ) 

and the Closed Gauss-Chebyshev quadrature rule has the form: 

(2.5) / =f(x)dx=?-Y"f(x k ) 
J a ^{x-a)(b-x) n N{-^ n 

where x k = ( b -^)z k + (*±2) and z k = cos{^) 

where the double prime (") on the summation sign implies that the first and end 
terms are halved. If the weight function is w(x) = 1 on the interval [a, b]. Then, the 
general Gauss-Legender quadrature rule has the form: 

,h , _ N-l 

(2.6) / f(x)dx=— -$>*/(**) 
Ja z fc=0 

where x k = (^^)zfc + (^7p) and z k are the k— th zeros of Pn(x). The coefficients 
Afe's can be calculated using the following formula : 

2 



(2.7) X k 



{l-zl){P' N {zk)Y 



2.4. Lemmas. Before starting the solution of general form of m-h LVIFDEs by 
least-square orthogonal method, the following two basic lemmas are needed: 

Lemma 2.5. [13], The Caputo fractional derivative for order m — l<a< m,m = 
\a~\, of shifed Chebyshev polynomial of degree k > 1, T^(x) — T k [2(j^) — 1] on 
interval [a, b] can be formulated: 
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C a D«TZ{x) 
(2-8) 



Lfc/2j 



/ \ 7~ft \ I \ 

(5^) (x-ar~ a E (-iy^ ) 2 k -^ M (x;k,r, m ) 



r=0 



where M(x; k, r, to) 




T(m-a+l) 
k—2r—m 



E 
£=0 



(-i)' 



r(€+m-a+l)r(/c-2r-m-^+l) 









m > k — 


2r 








to = k — 


2r 




/ \ " 


£ 






[2 


x — a \ 
yb-a] _ 




to < k — 


2r 



Lemma 2.6. [15], The Caputo fractional derivative for order m — l<a< m,m = 
[a], of shifed Chebyshev polynomial of degree k > 1, P^{x) = Pfc[2(f5f) — 1]> on 
interval [a, b] can be formulated: 



(2.9) 



£JW*(*) 



/ \ m L K / Z J 

f(s^) (*-°r-° E (-ir r^Ki) ^^." 1 ) 



where M(x;k,r,m) 




_ ^ r(m-a+l) 
fc— 2r— m 

E r(£+m-a+l)r(fc-2r-m-£+l) 



(-1) 



f+fc-2r 



( a: — a \ 
b-a) 



to > k — 2r 
m = k — 2r 

to < k — 2r 



3. Solution Technique 

In this section, a new technique for solving multi-higher linear VIFDEs with vari- 
able coefficients applying least-square data fitting with use orthogonal (Chebyshev 
and Legendre) polynomials has been presented. The method is to approximate the 
solution y(x) of equation (1.1) by i/n(x)'- 

N 

(3.1) y(x) =y N (x) = ^C r ip r (x) 

r=0 

The coordinate functions (f r (x) are usually chosen as orthogonal polynomials and 
C r 's arc undetermined constant coefficients for all r = 0, 1, 2, ... , N. Substituting 
2/Ar(x)in equation (1.1) , we obtain : 

N ( n-1 

E C r \ ^D^ip r (x) + £ P 4 (x)^LC"-> r (x) + P n {x)ip r {x) 
(o o\ r— I i— 1 



^Cnft 



/(&) + A £ C r / £ iO(af, t)^D P r- l Vr (t)dt + R N (x; C) 

r-0 a e=0 

n-1 

Define: ^ r {x) = c a D«"<p r (x) + £ Pi(x)° D? 1 -' (p r (x) + P„(x)ip r (x) 



,. m 

-A /£#,(*, t)?Df~-Vr(t)dt 
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Now, the equation (3.2) can be written as: 



JV 



(3.3) 



RN(X]C) = J2 C rM*)-f(*) 



where Rn(x; C) is the error function which is also depending on x and the constant 
coefficients C, {= (Co, C\, . . . , Cjv)}- The main points here, is how to find the 
coefficients C r ; r = 0, 1, 2, . . . , N of hn(x) in equation (3.1) such that Rn(x; C) is 
minimized. The general least squares techniques insist on minimizing the norm of 
the error functions by introducing a weight function w(x) on interval [a, b], we now 
wish to minimize 1(C): 

b 
1(C) = f w(x)\R N (x;C)\ 2 dx 



The necessary condition for 1(C) to be minimum, are 

b 

(3.4) ^M = 2 fw(x)R N (x;C) dRN d ^ C) dx = 0, a = 0, 1, 



.iV 



Putting equation(3.2) in to equation (3.4), we obtain: 



w(x)tp s (x) 



N 



^2C r ^ r (x) ~ f(x) 



dx = 



After some simple manipulation, the following linear system is included: 

JV 

(3.5) ^C r a rs = b s , 8 = 0,1,2,...,JV 



r=0 



where 



(3.6) 



a rs — j w(x)ip s (x)ip r (x)dx 

a 
b 

b s = f w(x)ip s (x)f(x)dx 



and Mx) =a D^ip r (x) + £ Pi(x)° DT^ y r (x) + P n (x)ip r (x) 



,. m 

A J2 K ^ x ^ D t m - e Mt)dt 



Rewrite equation (3.5) in matrix form as: 
(3.7) AC = B 

where 



A = 



aoo 


ooi • • 


&0N 


OlO 


an • • 


aijv 


awo 


ajvi • • 


&NN 





' Co ' 




' b Q ' 


,c = 


Cx 


,B = 


h 




Cn 




b N 
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In this technique the initial conditions of equation (1.1) are added as new rows in 
the system (3.7), these rows can be formed as: 



N 
J2Cr[4 k \x) 



r=0 

In matrix form, this gives: 

(3.8) 

where 



Vk, 



$c = r 



o,i, 



,M~ 1 



<E> = 



ipo(a) ipi(a) 

<p' (a) tp'iia) 






,C = 



Obtaining a new matrix by adding (3.8) to (3.7), yields 
(3.9) DC = E 



Co 



c 



N 



,Y = 



2/0 

2/1 



where 



D = 



A 



$ 



and 



E = 



D 



Y 



(JV+M+l)x(AT+l) L ' J (JV+^+l)x(l) 

To determine the constant coefficients C r 's in equation (3. 9), store the matrix D 
and compute D T D and D T E then use any numerical evaluation procedure to solve: 
[D T D]C — [D T E]. Then substitute the values C r 's in equation (3.1) ,thc approxi- 
mate solution is obtained for multi-higher linear VIFDEs (1.1). 

3.1. Using Chebyshev Polynomials. In this part, we can take instead of trial 



function ip r (x) the shifted Chrbyshev polynomial, T* (x) = T r 



2 P 

1 b—a 



l 



, to ap- 



proximate the solution y{x) of multi-higher order linear VIFDE (1.1-1.2) by formed: 



N 



vn{x) = y^c r r r 



r=0 



b — a 



a < x < b 



Substituting yw(x) in to equation (1.1), and applying the same steps described in 
(2.3), we conclude the system in (3.9), with: 

I: using Open Gauss Chebyshev formula (2.4) ,we have: 



(3.10) 



Af-l 

a rs = JJ E A(x 3 )lps(Xj) 
3=0 
M-l 

b s = Jf J2 i>s(Xj)f(Xj) 
3=0 



where M is the number of Chebyshev zeros we can take it and 

(3.11) { x . = (*=-)*.+ (*±«) ; ,. =cos( 2i±l f) 
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II: using Closed Gauss Chebyshev formula (2.5) ,we have: 



(3.12) 



M 
a rs = |f E "^r{Xj)^s{Xj) 

M 

bs = ^ E "^{Xj)f{xj) 

3=0 



where M is the number of Chebyshev extrema zeros we can take it and 

(3.13) { Xj = (^) Zj + (!*>) ; Zi=coa(%) 

To evaluate ip r (x) at x = £j for all r and j , for open and closed Gauss-Chebyshcv 
formulas, that is: 



(3.14) 



[A(x)] x=Xj = |f £>«» + g P % {x) c a D a — + P n (x)j T r [2 (f=f ) 



am 

-\f^K e (x,t)CD^-<T r 
a e=o 



2(H) -' 



We apply the following stages to evaluate the equation (3.14): First, using lemma 
(2.5) to evaluate the fractional differentiation of shifted Chebyshev polynomials at 
all x = Xj and all different r > 1. second, the integral part of equation (3.14) can 
be extended in to two integral terms I\ and I2 as: 



iCn^-" 



x m 

IEK e (x,t^D^T r 
a e=o 

x f . m-1 

Y, K ^ t ) C a D t m ' e Tr 



o— a 



eft 



£ — a 
b — a 



dt- 



"m\^t £,M r 



t — a 
b — a 



(11 



h 



For 7i: Apply lemma (2.5) for integrand fractional differentiation shifted Cheby- 
shev polynomials for different k > 1 , we obtain: 



(3.15) 



h = 



2 \ 6-0 



. ml \r/1\ 



p=0 



r(p+i)' 



* / K e {x,t)(t - a) m 't-P™- e M(t;r,p,m* e )dt 

a 

where M(t;r,p, m|) 







r(m*- / 3 m _ £ + l) 
r— 2p— m^ 

E r( 9 +m*-,3 m _ € + l)r(r-2p-m|- 9 +l) 

g=0 



(-1) 9 



2(!=« 

0— a 



m| > r — 2p 
m| = r — 2p 

ra\ < r — 2p 



For ^ Using the following transform to interchange the bounded points of integral: 

' t — a s 



z 



b- 



1: 



f 



6 — a . 6 + a 
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Thus 



(3.16) 



h = J K m (x,t)T r 

a 

2(f5f)-i 



2(H)-' 



dt 



b-o 
2 



/ K m (x,^t + iF)T r (£)d£ 



Finally, apply Romberg integrations rule to evaluate all integrals in equations (3.15) 
and (3.16) at each points x — Xj for all j. 

The Algorithm [ACP]: The approximate solution of multi- higher order linear 
VIFDEs with variable coefficients (1.1) with the use of Chebyshev polynomials in 
least-square orthogonal method can be summarized by the following stages: 

Step Is Evaluate ip r (xj), using equation (3.14) for all r = 0, 1, 2, . . . , TV and 
Xj's are defined by equations: (3.11) for open formed and (3.13) for closed 
formed. 

Step 2: From equations (3.10) and (3.12), compute a rs and b s for all r,s — 
0, 1, 2, . . . , N for open and closed Gauss-Chebyshev formula. 

Step 3: Construct the matrices D and E which represented in system (3.9). 

Step 4: For constant coefficients C r (r = (0 : N)) apply any numerical meth- 
ods for system which obtained in step 3 after multiply both sides by D T . 

Step 5: To obtain approximate solution un(x) of y(x), substituting C r 's in 
equation (3.1) where ip r = T* . 

3.2. Using Legendre Polynomials. In this part, we can take instead of trial 



function ip r (x) the shifted Legendre polynomial, P* (x) = P r 
proximate the solution y(x) of equation (1.1) by formed : 

N 



2(f5f)-J 



,to ap- 



!JN 



[x j — y L/fif 



, x — a , 
21 t ] -1 

b — a 



<x<b 



By applying the same stages as in Chebyshev polynomials technique and using 
Gauss-Legcndre formula (2.6), we get: 

{M-l 
a rs = ^ J2 ^jAix^ipsixj) 
bs = b ~T E \^s{x 3 )f{x 3 ) 
3=0 

where, M is the number of Legendre zeros we can take it and 

(3.18) 



b — a\ i b + a 

Zj + 



Here z 3 - are the M-th zeros of Pm(x) and A^ are defined as in equation (2.7), that 



A,= 



j=0,l,2,...,M-l 



{l-z1)[P' M {z 3 )Y 
To evaluate tp r (x) at x — Xj for all r and j , from Gauss-Legendre formula: 



bPr( X )] x = 



(3.19) 



x m 

-\JJ2K e (x,t)CD^- e P r 
a e=o 



2(f5f)-l 



1 b—a 



(ii 
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Apply same stage as to solve equation (3.14) with using lemma (2.6) we can obtain 
the values of equation (3.19). 

The Algorithm [ALP]: The approximate solution of multi-higher order linear 
VIFDEs with variable coefficients (1.1) with the use of Legendre polynomials in 
least-square orthogonal method can be summarized by the following stages: 

Step 1: Evaluate ip r (xj), using equation (3.19) for all r = 0, 1, 2, . . . , N and 

Xj's arc defined by equations: (3.18). 
Step 2: From equations (3.17) compute a rs and b s for all r, s = 0, 1, 2, . . . , N 

for Gauss-Legendrc formula. 
Step 3: Construct the matrices D and E which represented in system (3.9). 
Step 4: For constant coefficients C r (r = (0 : N)) apply any numerical meth- 
ods for system which obtained in step 3 after multiply both sides by D T . 
Step 5: To obtain approximate solution %jn(x) of y(x), substituting C r 's in 
equation (3.1) where <p r = P* . 

4. Numerical Examples 

Here, three numerical results are presented for multi-higher linear VIFDEs with 
variable coefficients. Their results are obtained by applying the algorithms (ACP 
and ALP), respectively. 

Example 4.1. Consider a linear VIFDE for fractional order lies in (0, 1) on closed 
interval [0, 1] : 

c D° x 6 y(x) + (x- l)y(x) = f(x) 

X 

+ J [(x 2 + tfD°- 7 y(t) + (1 - xt) C D° t A y(t) - sin(x - t)y(t)] dt 
o 
where 

f{x) = 2cosx + x 3 -2+ j^y* 1 - 4 - ^^y [33.x + 23]x 3 ' 3 

2 -[18-13a; 2 ]x 2 - 6 



r(4. 

together with the initial condition : y(0) = 0; while the exact solution is y(x) = x 2 . 

Take N — 2, M — 5 and IN = 7 (number of approximate parts for integrals 
using Romberg rule). Assume the approximate solution in the form: 

2 2 

y 2 (x) = Y^ C r T r (2x - 1) and y 2 {x) = ^ CrPr(2x - 1) 

r=0 r=0 

Apply algorithms ACP and ALP to find the approximate solution of above prob- 
lem by running the programs which written for this purpose in MatLab, table (1) 
present the values of C r 's, respectively. 
Thus, we get the following approximate formulas: 

y° pen (x) = -0.102e" 5 a; + 1.00000184x 2 

y ciosed^ = _ le -7 _ .l06e" 5 a; + 1.00000176x 2 

y^ eg (x) = -0.84e" 6 a; + 1.0000014a; 2 

Table (2) show the comparison between the exact solution y{x) and approximate 

solution 2/2 (#) f° r all open , closed Chebyshev and Legendre respectively depending 
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LSOR 
Method 


O7- s 


Co 


Ci 


c 2 


OHR 


0.37500018 


0.50000041 


0.12500023 


CHR 


0.37500012 


0.50000035 


0.12500022 


LER 


0.33333338 


0.50000028 


0.16666690 



Table 2 



X 

-Points 


Exact 
Solution 


Least-Square Orthogonal Method 


OHR 


CHR 


LER 





0.00 


0.0 


-9.99999 e-009 


0.0 


0.1 


0.01 


0.0099999164 


0.0099999016 


0.00999993 


0.2 


0.04 


0.0399998696 


0.0399998484 


0.039999888 


0.3 


0.09 


0.0899998596 


0.0899998304 


0.089999874 


0.4 


0.16 


0.1599998864 


0.1599998476 


0.159999888 


0.5 


0.25 


0.24999995 


0.2499999 


0.24999993 


0.6 


0.36 


0.3600000504 


0.3599999876 


0.36 


0.7 


0.49 


0.4900001876 


0.4900001104 


0.490000098 


0.8 


0.64 


0.6400003616 


0.6400002684 


0.640000224 


0.9 


0.81 


0.8100005724 


0.8100004616 


0.810000378 


1.0 


1.00 


1.00000082 


1.00000069 


1.00000056 


L.S.E 


0.122764 c-011 


.868310 c-012 


.567028 e-012 


R 2 = 


L.S.Ey 


0.231718 e-011 


.245802 e-011 


.449124 e-011 


R.Tir 


ne/Sec 


2.601344 


2.766926 


2.655042 



on the least square error and running time. The values residual equations R2(x; C) 
are also included by applying the formula (3.3). 

Note that, here if we take IN — 10 we obtain the exact solution y%{x) only for 
open and closed Chebyshev, which x 2 but not in Lcgcndrc used. 

Example 4.2. Consider the linear VIFDE on < x < 1 : 

C D 2 x a y{x) \ c D«y(x) + (1 + x 2 )y(x) = f(x) 

X 

+ J [xt c D 2a y(t) + (x 2 - tfD?y(t) + e x+t y(t)] dt 


where 



6 



f(x) = x 5 + x 3 - x 2 - 1 - 7e x - e 2x (x 3 - 3x 2 + 6x - 7) 



1- 



-x 3 x 3 - 2a - 



X 2 -l)x 3 - a -: 



6 



»6— a 



r(4-2a) V 5 -2a J T{4-a)\5-a J T(5 - a) 

with the initial conditions: if < a < 0.5; y(0) = —1 if 0.5 < a < 1; y(0) = —1 
and y (0) = 0. The exact solution of this problem is known y(x) = x — 1. 

we take two different values of fractional order a: 
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Table 3 



X 

-Points 


Exact 
Solution 


Least-Square Orthogonal Method 


OHR 


CHR 


LER 





-1 


-1.000000059 


-1.00000005 


-0.999999996 


0.1 


-0.999 


-0.999000050192 


-0.99900003984 


-0.99899999368 


0.2 


-0.992 


-0.992000034856 


-0.99200002392 


-0.99199998704 


0.3 


-0.973 


-0.973000017024 


-0.97300000608 


-0.97299997896 


0.4 


-0.936 


-0.936000000728 


-0.93599999016 


-0.93599997232 


0.5 


-0.875 


-0.87499999 


-0.87499998 


-0.87499997 


0.6 


-0.784 


-0.783999988872 


-0.78399997944 


-0.78399997488 


0.7 


-0.657 


-0.657000001376 


-0.65699999232 


-0.65699998984 


0.8 


-0.488 


-0.488000031544 


-0.48800002248 


-0.48800001776 


0.9 


-0.271 


-0.271000083408 


-0.27100007376 


-0.27100006152 


1.0 





-1.60999 e-007 


-1.499999 c-007 


-1.24000 c-007 


L.S.E 


0.416042 c-013 


.341208 c-013 


.225431 c-013 


R 3 = 


L.S.Ey 


0.308311 c-012 


.270284 c-012 


.163582 c-011 


R.Tir 


ne/Sec 


1.829995 


1.871905 


1.864961 



• For a = 0.6 , take TV = 3, M = 4 ,and IN = 5. So apply algorithms ACP and 
ALP to find the approximate solution and we obtain the following approximate 
formulas: 



Open 



(x) 



y Closed {x) 
Lea / \ 

% ( x ) = 



-1.000000059 + 0.42e" 7 a; + 0.528e~ 6 a; 2 + 0.999999328a; 3 
= -1.00000005 + 0.6e~ 7 .T + 0.48e~ 6 a; 2 + 0.99999936a; 3 
-0.999999996 - 0.8e- 8 a; + 0.36e~ 6 a; 2 + 0.99999952a; 3 
Table (3) list the results obtained by running the programs for comparison the 
approximate solutions of the above problem .Included arc the least square error 
and running time with the values of residual R^{x; C) for comparison. 

• For a = 0.4, take N = 3, M = 4, and IN — 5. Assume the approximation 
solution same as before, by running programs the result of least square orthogonal 
methods are obtained as follows:Thus, we get the following approximate formulas: 



Open / \ 

V 3 ( x ) 



,-7„ 



-~ 6 x 2 



y? osed (x) - 

Lea / \ 

y 3 ( x ) 



0.416e 



-v 



0.999999616a; 3 
-1.0000000055 - 0.4e- 7 x + 0.27e~ 6 a; 2 + 0.9999997a; 3 
Tabic (4) shows a comparison between the exact solution y(x) and approximate so- 
lutions j/3 (a;) for all open, closed Chcbyshcv and Lcgendre polynomials respectively. 
Note that for a = 0.6 and a = 0.4, if we take M = 5 and IN = 10 we obtain 
the exact solution for all types. 

Example 4.3. Consider a higher-order linear VIFDE's on bounded interval [0, 1]: 
C Dl- e y(x) l - c D]: 2 v(x)+x c D^y{x) + cosxy(x) = /(.*) 



6 



+ 



X 

/ [5xt c D° t - 8 y(t) + (x 2 +t 2 ) c D°- 3 y(t) - e 2 ^*y(i)] dt 
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X 

-Points 


Exact 
Solution 


Least-Square Orthogonal Method 


OHR 


CHR 


LER 





-1 


-1 


-0.999999998 


-1.000000005 


0.1 


-0.999 


-0.99900000512 


-0.999000005824 


-0.9990000066 


0.2 


-0.992 


-0.99200000576 


-0.992000007632 


-0.9920000046 


0.3 


-0.973 


-0.97300000384 


-0.973000005728 


-0.9730000008 


0.4 


-0.936 


-0.93600000128 


-0.936000002416 


-0.935999997 


0.5 


-0.875 


-0.875 


-0.875 


-0.874999995 


0.6 


-0.784 


-0.78400000192 


-0.784000000784 


-0.7839999966 


0.7 


-0.657 


-0.65700000896 


-0.657000007072 


-0.6570000036 


0.8 


-0.488 


-0.48800002304 


-0.488000021168 


-0.4880000178 


0.9 


-0.271 


-0.27100004608 


-0.271000045376 


-0.271000041 


1.0 





-7.999999 e-008 


-8.2000 e-008 


-7.50000 e-008 


L.S.E 


0.921395 c-014 


0.941651 c-014 


0.77717 e-014 


X?3 = L.S.Ey 


0.730238 c-013 


0.402558 c-013 


0.44232 c-013 


R.Time/Sec 


1.868506 


1.885245 


1.892130 



Table 5 



LSOR 

Method 


O7- s 


Co 


c\ 


c 2 


c 3 


OHR 


1.6874989 


0.53125069 


-0.18750006 


-0.3125 e-1 


CHR 


1.6874990 


0.53125055 


-0.18750009 


-0.3125 e-1 


LER 


1.7499998 


0.54999976 


-0.25000045 


-0.5000 e-1 



where 



Six) 



-12 



-1.8 



„l-4 



.1.5 



„3.5 



r(2i 



r(2.4) 



1 



x z 1 x 3 - 2 - 6 



r(i.5) 
/ 4.86 



r(3.5)' 



44.18 

r(3.2) r(5.2)~y~ " V r ( 4 - 7 ) r(6.7) 

2 1 ™.3\ „x oJ2x 1 { -t 1 o™ ^.3 



x 2 x 3 - 7 



+ (3 + 4a; + 3x 2 + x 3 ) e x - 3e 2x + (l + 2x - x 3 ) cosx 



with the initial conditions : y(0) = 1 , y (0) 
y(x) = 1 + 2x - x 3 . 



while the exact solution is 



Which is multi-higher order linear VIFDEs with variable coefficients. Take N = 
3 , M — 5 ,and IN = 8, Assume the approximate solution in the forms: 

• For Chebyshev polynomial yz{x) — X)r=o C r T r (2x — 1) 

• For Legender polynomial yz{x) = J2r=o C r P r {2x — 1) 

Run programs to find the approximate solution of the above problem, table(5) 
present the values C r 's , respectively . 

Thus, we get the following approximate formulas: 

y° pen (x) = 0.999998235 + 2.00000033a; + 0.36e~ 5 .T 2 - 1.00000272a; 3 

yCiosed^ = 0J99998448 + 2.000000236a; + 0.3504 e - 5 a; 2 - 1.000002816a; 3 

y^ e9 {x) = 0.999999772 + 2.000000036a; + 0.276e- 5 a; 2 - 1.00000364a; 3 
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Table 6 



X 

-Points 


Exact 
Solution 


Least-Square Orthogonal Method 


OHR 


CHR 


LER 





1 


0.999998235 


0.999998448 


0.999999772 


0.1 


1.199 


1.19899830128 


1.19899850382 


1.19899979956 


0.2 


1.392 


1.39199842324 


1.39199861283 


1.39199986048 


0.3 


1.573 


1.57299858456 


1.57299875813 


1.57299993292 


0.4 


1.736 


1.73599876892 


1.73599892282 


1.73599999504 


0.5 


1.875 


1.87499896 


1.87499909 


1.875000025 


0.6 


1.984 


1.98399914148 


1.98399924278 


1.98400000096 


0.7 


2.057 


2.05699929704 


2.05699936427 


2.05699990108 


0.8 


2.088 


2.08799941036 


2.08799943757 


2.08799970352 


0.9 


2.071 


2.07099946512 


2.07099944578 


2.07099938644 


1 


2 


1.999999445 


1.999999372 


1.999998928 


L.S.E 


0.152607 c-010 


0.120976 c-010 


0.174010 c-011 


i?3 = 


L.S.Ey 


0.140318 c-010 


0.319769 c-010 


0.199853 c-009 


R.Tir 


ne/Sec 


5.774807 


5.944966 


5.891868 



Table (6) present a comparison between the exact solution y(x) and numerical 
solution yz(x) for all open , closed Chebyshev and Legendre respectively depending 
on the least square error and running time. Furthermore, it included the values of 
residual equations R^(x;C) by applying formula (3.3). 

Note that for IN = 15 we can get the exact solution for all types. 

5. Discussion 

In this chapter, three numerical algorithms have been applied to solve the multi- 
higher order of linear VIFDEs with variable coefficients. For each algorithm, a 
computer program was written and several examples are included for illustration 
and good results are achieved. The least square error, least square error function 
(y) and running time are all also given in tabular forms. The following points have 
been identified. 

1: This method can be used even where there is no information about the 
exact solution (from the error function Rn(x;C) in equation (3.3)). 

2: The good results depend on: number of approximate parts of integral IN 
(if we take IN a large number we obtain the exact solution of dn(x)) 
and the number of the orthogonal polynomials N ,with suitable number of 
polynomial zeros M . 

3: By running the programs of ACR and ALR the least square error function 
in Chebychev polynomials gives more accurate solutions than Legendre 
polynomial method, see tables (2), (3), (4) and (6), so it is better. 

4: In general, the solution by Chebyshev polynomials is easier and faster than 
of Legendre polynomials. 
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COMPARING SOME ROBUST METHODS WITH OLS METHOD 
IN MULTIPLE REGRESSION WITH APPLICATION 

SIZAR ABED MOHAMMED 



Abstract. The classical method. Ordinary Least Squares (OLS) is used to 
estimate the parameters of the linear regression, when assumptions are avail- 
able, and its estimators have good properties, like unbiascdncss, minimum 
variance, consistency, and so on. The alternative statistical techniques have 
been developed to estimate the parameters, when the data is contaminated 
with outliers. These are the robust (or resistant) methods. In this paper, three 
of robust methods arc studied, which arc: Maximum likelihood type estimate 
M-estimator, Modified Maximum likelihood type estimate MM-cstimator and 
Least Trimmed Squares LTS-cstimator, and their results arc compared with 
OLS method. These methods applied to real data taken from the Charstin 
company for manufacturing furniture and wooden doors, the obtained results 
compared by using the criteria: Mean Squared Error (MSE), Mean Absolute 
Percentage Error (MAPE) and Mean Sum of Absolute Error (MSAE). The 
important conclusions that this study came up with are: the number of outlier 
values detected by using the four methods in the data for furniture's line are 
very close. This refers to the fact that the distribution of standard errors is 
close to the normal, but the outlier values found in the data for doors line, by 
using OLS arc less than which detected by robust methods. This means that 
the distribution of standard errors is departure distant from the normal. The 
other important conclusion is that estimated values of parameters by using 
OLS are very far from its estimated values by using the robust methods with 
respect to doors line, the LTS-cstimator gave better results by using MSE cri- 
terion, and M-cstimator gave better results by using MAPE criterion. Further 
more, it has noticed that by using the criterion MSAE, the MM-estimator is 
better. The programs S-plus (version 8.0, professional 2007), Minitab (version 
13.2) and SPSS (version 17) are used to analyze the data. 



1. Introduction 

Regression is one of the most commonly used statistical techniques. Out of many 
possible regression techniques, the Ordinary Least Squares OLS method has been 
generally adopted because of tradition and ease of computation. However, OLS 
estimation of regression weights in the multiple regression are affected by the oc- 
currence of outliers, non-normality, multicollinearity and missing data. Outliers 
are observations that appear inconsistent with the rest of the data. The influential 
points remain hidden to the user, because they do not always show up in the usual 
least square residual plots. To remedy this problem new statistical techniques have 
been developed that are not easily affected by outliers. There are many robust 
methods such as: Least Median Squares estimates LMS, Least Trimmed Squares 
estimates LTS, Least Winsor Squares estimates LWS, Rank statistics estimates 



Key words and phrases. Robust Regression, M-estimate, MM-cstimatc, LTS-estimatc. 
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R-estimate, Maximum likelihood typo estimate M-cstimatc, Laplace statistics esti- 
mates L-estimate, an adaptive maximum likelihood type estimates, Scale statistics 
estimates S- estimate, Modified Maximum likelihood type estimate MM- estimates 
and Generalized Maximum likelihood estimate GM- estimates. Through the im- 
portant observations of robustness, the indicated deletion seen in outlier in the 
data is not a solution and it became a topic of the past that can not be adopted. 
The main purpose of robust regression is to provide resistant parameters in the 
presence of outliers. In order to achieve this stability, robust regression limits the 
influence of outliers. In this paper three of robust methods are studied, M-estimatc, 
MM-estimate and LTS-cstimate, and their results are compared with OLS method. 
These methods are applied to the real data taken from the Charstin Company for 
the manufacture of furniture and wooden doors located in the city of Duhok. The 
aim of this paper is studying some robust methods of parameters estimation of 
linear regression in the presence of outliers in data, and comparing the results with 
OLS estimators 

2. The Basic Concepts 

2.1. Outliers. [4, 7, 21]. Any statistician, when analyzing a set of real data may 
encounter outlier values called observations that deviate significantly from any other 
observations, which are not consistent with other observations. Those values may 
be large or small and may result from an error in recording data or the preview 
or the appearance caused by other natural causes. There is no agreement on a 
specific definition for outlier value. However it is called contaminated, conflicting 
observation, discordant observation or irregular values. This concept has many 
definition. For example, it was defined by Grubbs, 1969 as the value that looks 
significantly deviant ( Deviate Markedly ) from the other simple observations, it 
arises from heavy-tailed distributions or mixed distributions. The presence of out- 
liers in a sample of several variables is more complex than a single variable case. 
The presence of outliers in the data leads to great confusion in the analysis of the 
data in the case of using traditional methods in the estimation. One of these meth- 
ods is the least square method. Hubcr P.J., 1973 showed the effect of outlier values 
in the estimation of least squares through his famous saying "the existence of an 
outlier value damages the good characteristics of the estimation of least squares it 
also leads to achieving success for LS method". 

2.2. Regression outliers. [1, 21, 23]. Regression outliers are those observations 
that do not trace the model and the outlier value results from Yi unusual or Xi 
unusual or both as: Rousseeuw (1987) made it obvious by examples of the impact 
of outlier in the estimation of the least squares and expounded how one observa- 
tion changes from the direction of the line of least squares. Chatterjee and Hadi, 
(1988) knew that the outliers of regression those observations that have a great 
standardized residual compared to the rest of the other observations. (Draper and 
Smith) knew that an outlier observation in linear regression is that observation in 
which the absolute value of residuals is much larger than the rest of the absolute 
values of the rest of the other observations and perhaps it may become away from 
the average value of those residuals 3cr by 4cr or more. 

2.3. Leverage points. [21]. They are those points which are located away from 
most of values in the matrix X in the form Y = X/3 + e which include one or more 
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of the explanatory variables in regression analysis and have a strong influence on 
the estimation of ordinary least squares. When observation X$ is an outlier point, 
we call an observation {Xk,Yk ) a leverage point whenever^ lies far away from the 
bulk of the observation Xi in the sample, note that this does take Yk into account, 
so the point (Xk,Yk ) does not necessarily to be a regression outlier, when (Xk,Yk ) 
lies close to the regression line determined by the majority of the data, then it can 
be considered a good leverage point. Therefore, to say that (Xk,Yk ) is a leverage 
point refers only to its potential for strongly affecting the regression coefficients , 
but it does not necessarily mean that (X/~,Yk ) will actually have a large influence 
on , because it may be perfectly on line with the trend set by the other data. In 
multiple regression, the (Xji,a€|,Xj p ) lie in a space with p dimension, a leverage 
point is then still defined as a point (Xki,&€\,Xk p ,Yk ) for which (Xki,&€\,Xk p ) is 
outlying with respect to the (X il ,a€|,X ip ) in the data set. As before, such leverage 
points have a potentially large influence on the LS regression coefficients, depending 
on the actual value of Yk . 

2.4. Breakdown points. [21]. A useful measure of robustness is the breakdown 
value, its oldest definition Hodges (1967) was restricted to one dimensional estima- 
tion of location, where as Hampel (1971) gave a much more general formulation. 
Unfortunately, the latter definition was asymptotic and rather mathematical in 
nature, which may have restricted its dissemination. This study used the simple 
finite-sample version of the breakdown point, introduced by Donoho and Huber 
(1983). Z = {(X 11 ,X 12 ,...,X lp ,Y 1 ),...,(X nl ,X n2 ,...,X np ,Y n )} and we want to 
estimate its for this purpose, we apply a Take any sample of n points in p dimen- 
sion: translation equivariant . This means that applying T to such a sample Z 

yields a vector! (3 , J3 1 , ....,(3 p ) of regression coefficientsT (Z) = J3 , now we consider 

all corrupted samples Z obtained by replacing any m of the original data points 
by arbitrary values, and we define the maximal bias by max bias (ra; T, Z) = 
Sup z > | \T (Zi) — T {Z)\\ , if bias (m; T, Z) is infinite, this means that m outliers can 
have an arbitrarily large effect on T, it follows that maxbias(rn;T, Z) — oo, hence 

T IZ ) becomes useless. Therefore, the finite sample breakdown point of the esti- 

{777 1 

— ; bias (m; T, Z) = oo > . 

Is again the smallest fraction of contamination that can cause T to take on values 
arbitrarily far away from T(Z). obviously, the multivariate arithmetic mean pos- 
sesses a breakdown point of — . We often consider the limiting breakdown point 

n 
for n — > oo by £* (T) , so we say that themultivariate mean has breakdown. It 

is clear that no translation equivariantT can have a breakdown point large than 
0.50. In order to be able to estimate the original parameters /3 , ..., (3 p we need that 
C < C* (T) . For this reason £* (T) is some times called the breakdown bound of 
T. For least squares, we have seen that one outlier is sufficient to carry T over 

all bounds. Therefore, its breakdown point equals £* = — which tends to zero for 

77 

increasing sample size n, so it can be said that LS has breakdown point of 0. This 
again reflects the extreme sensitivity of the LS method to outliers. 

2.5. Criteria for judging accuracy of estimation method. [10, 19]. In cases 
of appreciation of applied precision treated as a criterion to test the method of 
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estimation, there are several reasons that lead to inaccuracies example of this data 
or the use of non-sufficient procedure docs not match the data type. Accuracy is the 
most widely used criterion to evaluate the achievement of the estimation methods 
which reflect the health of predictable, but the difficulties attached to standard of 
accuracy in the estimate is the absence of a single measurement, acceptable, and the 
accuracy of the complex at the some time. There is a number of criteria commonly 
used by statisticians, including: 

2.5.1. Mean Squared Error (MSE). : The mean squared error is a measure of ac- 
curacy computed by squaring the individual error for each item in a data set and 
then finding the average or mean value of the sum of those squares. The mean 
squared error gives greater weight to large errors than to small errors because the 
errors are squared before being summed. It takes, the following formula: 

MSE = ^- -— - -^ 

n — p — 1 n — p — 1 

2.5.2. Mean Absolute Percentage Error (MAPE). : The mean absolute percentage 
error is the mean or average of the sum of all of the percentage errors for a given 
data set taken without regard to sign, i.e., (their absolute value is summed and the 
average computed). It takes the following formula: 

n 

E \PE t \ 
MAPE = — * 100 



; y - Yi 

where PEi 



Yi 

2.5.3. Mean Sum of Absolute Error (MSAE). : It is a common standard to differ- 
entiate between the method of least squares and the method of absolute error and 
it takes the following formula: 

n n 

EiYi-n eni 

= 1 i=l 



MSAE: 

n n 

It should be mentioned that minimizing the average absolute errors as to minimize 
the total absolute errors and this is sometimes called the Least Absolute Value 

(LAV). 

2.5.4. The Ordinary Least Squares method (OLS). :[14, 23]. The basic idea of the 
ordinary least squares method is to make the sum of squares of errors (SSe) as small 
as possible: 

n 

minS^ e-i, % — 1, 2, 3, ..., n 

i=\ 

Therefore Gauss preferred it to other methods, because of easy estimation of pa- 
rameters from data. He also assumed that e^ as a variable follows the normal 
distribution, so the method of OLS is practical idea, if all the assumptions related 
to it are achieved. If one or more of these assumptions are not achieved, this will 
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lead to getting inaccurate estimation of parameters j3 of the model and can not be 

adopted: 

e = Y -Xf3-> e = Y -Y 

The estimation of the parameters f3 that make SSe minimum as possible is called 
the least squares normal estimation Pols w hich produces a (p) solution of simul- 
taneous equations can be given as: 

Pols= ( xx ) X'Y,thmthefonnYi = o + p 1 X il + 2 Xi2 + ...+P p X ip + eiia 
called linear model of full rank regression models. The specific determinate second 

d 2 (ee'\ 

partial derivative of the vector f3 OLS will always be greater than zero | 2 — I > 



2.5.5. Assumptions of least squares method. :[15, 17, 22, 23]. There are a number 
of assumptions based on the study usual method of OLS in linear regression model: 
A- Assumptions about the error: 

1- Mathematical expectation of the random error is equal to zero E (e») = 0,i = 
1,2,..., n 

It follows that: E (Y t ) =E((3 + ii x X % + e t ) = (3 + ^X, + E (e t ) = (3 + ^X, 

2- Contrast the values of a random variable to be constant in each period of time 
Var(ei) = E (e?) = a 2 i = 1,2, ae\,n . This is called homoscedasticity of er- 
ror variation. It therefore follows that the variance of the response variable Y{ is: 
a 2 (Yi) = a 2 ,since a 2 (0 O + fi x Xi + e*) = a 2 (e 4 ) = a 2 . 

3-A random variable ei distributed as normal distribution with mean zero and vari- 
ance a 2 , a ~ N (0, a 2 ) ,i — 1, 2, de|, n 

4-If for any i ^ j , e,, ej are independent, then there is no autocorrelation between 
ei and ej , hence the outcome in any one trial has no effect on the error term for any 
other trial as to whether it is positive or negative or small or large since the error 
terms ei and ej are uncorrelated, this means that the covariance between them is 
equal to zero Cov (ej, ej) — 0, i ^ j — 1, 2, ..., n, 

5-Indcpcndence between the explanatory variables and random variables, E (e,, Xij) ■■ 
,that is to say e, independent of X^ for all different values of i, this means there 
is no problem of multicollinearity. 

B- Assumptions on the distribution of the response variable Y: 
1-Average Yi is a function of straight line Y — E (Yi) — f3 +j3 1 Xi+f3 2 X2 + ---+f3 p X p 

, i = 1,2, ...,n. where /3 , beta\, ..., beta p are estimates of regression parameters, 

since the regression function relates the means of the probability distribution ofY 

for any given X to the level of X. 

2- Variance of Yi ,Var(Yi) has one value for any value of i Var(Yi) — a 2 , i — 

l,2,...,n. 

3- Response variable Yi distributed as normal distribution with mean \x and variance 

a 2 ,i.e., Y-N^a 2 ). 

4- Any two observations Yi and Yj are uncorrelated, this implies that, Cov (Yi, Yj) — 
for all i ^ j = 1, 2, ..., n 

5- The relationship between X^s and Y be a linear relationship is the equation of 
straight line Y = E (Y { ) = O + /3 1 X 1 + $ 2 X 2 + ... + /3 p X p , i = 1, 2, ..., n . 
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2.5.6. Properties of ordinary least squares method. :[1, 2, 16, 23]. 
1-That (3 unbiased estimate of the parameter /3,E (ft) — f3 . 

2-Estimators have the least variation among all linear unbiased estimators com- 
pared variations of unbiased linear estimators that belong to any other ways, 
i.e., the best linear unbiased estimates. 3-The consistent destinies mean that 
thep r [a" ,$ OLS — 0a", < (] — 1 when n — > oo for each £ > or when $ OLS — (3. 
4-Sufficient destinies mean that the estimate f3 called the estimate enough for f3 if 
all information about the parameter of the community (3 is included in the sample 
data. 

5-Matrix of variance- covariance of the estimator J3 is: Var ( ft I = ( X X I a 2 . 

3. Some Robust Regression Methods 

3.1. Outliers detection. : There are many methods to detect the outliers in the 
data, which are the classic and robust methods; in this paper the robust methods 
are used. A-Rousseeuw and Van Zomeren [8, 9, 21]. proposed using robust methods 
to detect the outlier of linear regression as follows: 

1- Fit the regression model , then use the LMS estimation method for the detection 
of the outlier, then the residuals which are calculated for standard least median of 
squares estimates according to the following: 



SRi — t,*= 1, 2, ...,nwhere<j — K \ rnede 2 , 

(7 ' 

where K is arbitrary positive constant. 

2- Estimation method uses Minimum Volume Ellipsoid (MVE) estimator to find 
the lowest volume of the ellipsoid on the matrix regression such as Z if that X = 
f 1 : Z J .MVE estimator looking for smaller pieces containing half of the data 
which are usually calculated averages and covariance of the points inside the ellipse 
and then re-measure these estimates so that the estimated average community 
variability and the fact that when the joint sampling distribution of multivariate 
normal. 

3- Calculate the robust distance of all views as follows: 

RD(X t ) = yjiXi -T{Xi)j (C (Y)) 1 ( X i ~ T ( X i)), 

where T (Xi): estimate MVE vector of multivariate location parameters. C Q^) : 
PXP a square matrix represents the estimate MVE of the matrix variance-covariance 
of the multivariate. 

4- Use the graph to discover outliers and leverage points, as the following: 

Graph 3.1: The rule that could be adopted in the diagnosis process. 



2.5 


(A) Regression outlier 


(B) Bad leverage point 





(C) Good leverage point 


(D) Good leverage point 


-2.5 


(E) Regression outlier 


(F) Bad leverage point 



Graph 3.1. The rule that could be adopted in the diagnosis process. Graph 3.1. 
shows that the cut point for standardized LMS residuals is (A±2.5), the cut point 



of the robust distance RD (Xi) is WX( 1 _ QXiy) if RD (Xi) > WX( 1 _ QXiy) then X 
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outlier values. 

The graph 4.1. divided into six regions and views are classified as follows: a€^ 
Observation i is good leverage point if it takes place in C. a€^ Observation i is good 
leverage point if it takes place in D. a€^ Observation i is regression outliers if it 
takes place in A or E. a€^ Observation i is bad leverage point if it takes place in B 
or F. B- Rousseeuw and Leroy,1987 [21] used a method to detect outliers in multiple 
linear regression method as follows: 1-Apply one of robust methods. 2-Find the 
estimated values for the response variable. 

Y = 0o + Pi x a + 02 x a + ■■■ + P P x ip> i = 1, 2, ..., n 

3- Residuals which are calculated for standardLMS 1 according to the following: 

SR{ = -4- , i = 1, 2, ..., n where a = KWmedej 
a 
Graph 3.1. shows that the cut point for standardized LMS residuals is the 

(A±2.5), therefore the observation which lie outside this region is outlier. 



3.2. Robust estimation methods. [3, 5, 8, 11, 12, 13, 15, 18, 20, 21, 23]. One of 
the main concerns of knowledge of statistics is that analyzing data for the study or 
scientific research and interpretation of results get to interpret the data based on 
rules and modalities and in some cases the researcher faces a problem in data such 
as departure sample data distribution is assumed because of the outlier values or 
distribution community under study is supposed to distribution. This is because 
the deviations from the assumptions will come and put the traditional methods for 
estimating the parameters of the model. The researchers found that these methods 
are not efficient in the case of failure to achieve one of the assumptions or conditions 
upon which these methods work, so the researchers had to find more efficient meth- 
ods and not affected much deviations from specific assumptions. These methods 
are called robust methods. Although there are different ways of robust regression, 
but most of them share two basic points, one of them gives less weight to the view 
that the outlier found and the other is to use an iterative method. 



3.2.1. Maximum likelihood estimator (M- estimator): M-estimators are based on 
the idea of replacing the squared residuals ef used in least squared LS estimation, 
which minimize* X)"=i e f >by another function of residuals. 

That minimizes J^ ; _ 1 p (ej) where p is a symmetric function, i.e., p (— t) = p (t) for 
all t , with a unique minimum at zero. Differentiating this expression with respect 
to the regression coefficients /3„- yields, X^ i=1 ip {&%) Xi — , where ip is the derivative 

of p , i.e. tp (X, j3) — — - — , and Xi is the row vector of explanatory variables 

of the i-th observation Xi = (Xii,Xi2,ae\,X ip ), therefore Y^i=i V' ( e i) x i = is 
really a system of p-equations, the M -estimate is obtained by solving the system of 
p-equations, the solution of which is not always easy to find. In practice, Newton- 
Raphson and iteratively reweighted least squares are the two methods to solve 
the M-estimated nonlinear normal equations. Iteratively reweighted least squares 
express the normal equations as: X WX/3 — X WY where W is an nxn diagonal 
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matrix of weights: 




The initial vector of parameter estimates, (3 are typically obtained from OLS. 
Iteratively reweighted least squares updates these parameter estimates with j3 = 

I X WX ) X WY . However the solution of Y^i=i ip (e^) X t — is not equivalent 
with respect to a magnification of the Y"-axis. Therefore, one has to standardize 
the residuals by means of some estimate of a , yielding minimize'^2" =1 p ( ^r ) 

Where a must be estimated simultaneously motivated one possibility is to use the 
Median Absolute Deviation {MAD) scale estimator: a = cmed (|e^ — medri\) , 
where c = 1.4826 if Gaussian noise is assumed, or a — 2.1med\ei\ or a — 1.5med\ei\ 
Huber proposed to use the function ip (t) — min(c,max (t,—c)) where c is some 
constant, usually around 1.5, as a consequence. However, the breakdown point of 
M-estimator is equal to zero. 

3.2.2. Modified Maximum likelihood type estimate (MM- estimator) : It is a class of 
robust estimators for the linear model, MM-estimation is a special type of M- 
estimation developed by Yohai V.J. (1987). MM-estimation is a combination of 
high breakdown value estimation and efficient estimation Yohai's MM-estimators 
have three stage procedures. 

1- The first stage is calculating an S-estimate with influence function. 

/ 3(-) 2 -3(-) 4 + (-) 6 if\x\<c, 
p{x) — < c c c 

[ 1 otherwise. 

The value of tuning constant c, is selected as 1.548 Where .S-estimate is a high 
breakdown value of robust regression methods introduced by Rousseeuw and Yohai 
(1984) that minimizes the dispersion of the residuals. The objective function is min 
s (ei (/3) , ei ((3) , ..., e n (/?)) where e^ (/3) ,is the i-th residuals for candidate j3. This 

1 n { Yi —Yi\ 

objective function is given by the solution, Y2"—i X — = k where k 

n — p \ s I 

is a constant. 

2- The second stage calculates the MM parameters that provide the minimum 

value of Y^ii=i P ~ ^ where k is the influence function used in the first 

A a ° J 

stage with tuning constant 4.687 and do is the estimate of scale from the first step 
(standard deviation of the residuals) . 3- The final step compute the MM-estimate 

of scale as a solution to 5^1-1 P ~ ~ — 0-5 

n — p \ s I 

3.2.3. Least Trimmed Square Estimator (LTS-estimator) : Least trimmed square es- 
timation LTS is statistical technique for estimation of the unknown parameters of a 
linear regression model and provides a robust alternative to the classical regression 
methods based on minimizing the sum of squared residuals. The least trimmed 
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squares estimator f3 LTS is defined as: j3 LTS = mirig ^2 i=1 ef , the trimming con- 
stant h has to satisfy — < h < n . Where e\ < e\ < ... < e 2 n represents the i-th 

order statistics of square residuals ef = [Yi — X i f3\ ,i ~ 1, 2, 3, ..., n , the break- 

ti — h ~ 

down point value is for theLTS estimator. To find the estimator j3 LTS it be 

taken into consideration n — h + 1 from the following subsample 

{Ai, A 2 , ...,Xh} 

{X 2 ,X 3 ,...,X h + l} 

{A3, A 4 , ...,X h + 2} 

{X n -h+i, X n -h + 2, ..., X n }. 

All sub sample comprises on h elements, it called the Contagious Half. Then, it 
computes the means for all subsample as following: 

1 k 
* = ,I> 

i—1 
-, fe+1 

i=2 



X n - 



n-fc+1 



n 

i=n — k-\-l 



And computing the sum of squares for all subsample 

h 

SQ 1 = V^-Ai) 2 



2 

2J 



SQ 2 = ^(Ai-X 



n 



SQn-h+l — 2_^ (Aj — A„ 

«— rt— fr+1 



/ y (Aj — A„_^ + ij 



Then the least trimmed squares estimator Plts wu ^ be equal to the mean which 
corresponds the smallest sum square to equation. 

4. The Application Part 

4.1. Data collection: Data was taken from Charstin Company for manufacturing 

furniture and wooden doors which located in the city of Duhok. It was established 

in 2004, where the variables included for each line according to months of the years 

for the period 2004 To 2010, as follows: 

1- Values of the monthly production as the dependent variable (Y). 

2-Costs of electricity and water, as a first explanatory variable(A!). 

3-Cars expenses and maintenance of machinery as a second explanatory variable 
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(X 2 ) 



4- Communication expenses as a third explanatory variable (A3). 
The unit of measurement of all the variables above is the U.S. dollar. 

4.2. Multiple linear regression model: The following equation represents the 
regression model for each production line in the company: 



(4.1) 



Y % = /3 + ftl ;i + /3 2 X t2 + /3 3 X i3 + ei 



Where: Yf. The value of i-th observation of the response variable (production). 
Xn: The value of i-th observation of the first explanatory variable (costs of elec- 
tricity and water). X i2 : The value of i-th observation of the second explanatory 
variable (cars maintenance and machines expenses). X^: The value of i-th obser- 
vation of the third explanatory variable (communication expenses). By using OLS 
method with software Minitab version (13.2), the linear regression model fitted for 
each production line, as the following: 
Furniture's line model: 



(4.2) % = 

Door's line model: 



-7283 + 36.3X 4 i + 16.372X i2 + 17.532X, 



;.;>> 



(4.3) 



Y % = -191597 + 1034.9Xa - 1178.6X i2 + 679.5X 



,:', 



The results of the regression analysis represented in the tables 4.1,4.2,4.3 and 
4.4 as follows: 

Table 4.1: The regression coefficients for furniture's line by using OLS. 



predictor 


coefficients 


Standard error of coefficients 


t. value 


Constant 


-7283.000 


1490.000 


-4.888 


x l 


36.300 


5.468 


6.639 


x 2 


16.372 


5.176 


3.163 


X, 


17.532 


2.587 


6.777 



#=5722, i?-square=88.4 
Table 4.2: Analysis of variance (ANOVA) table for furniture's line by using OLS. 



source 



Regression 



Residuals 



Total 



DF 



80 



83 



SS 



19882293231 



2619662871 



22501956102 



MS 



6627431077.000 



32745785.890 



202.390 



Tabic 4.3: The regression coefficients for door's line by using OLS. 



predictor 


coefficients 


Standard error of coefficients 


t. value 


Constant 


-191597.000 


63870.000 


-3.014 


X 


1034.900 


204.000 


5.073 


x 2 


-1178.600 


269.300 


-4.377 


x 3 


679.500 


238.900 


2.844 



5=196086 i?-square=69.8 

Table 4.4: Analysis of variance (ANOVA) table for door's line by using OLS. 
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source 


DF 


SS 


MS 


F 


Regression 


3 


7.125224E+12 


2.37508E+12 


61.770 


Residuals 


80 


3.07599E+12 


3.84499E+10 




Total 


83 


1.02012E+13 







From tables 4.1 and 4.3, It can be concluded that all parameters of the model 
for both lines are significant, through out noticing the absolute t- values > 2 or the 
p-values < 0.05. And from ANOVA tables 4.2 and 4.4, It is noticed that F-value is 
significant for both lines, since the value of calculated F greater than the tabulated 
value under significant level a — 0.05 and degrees of freedoms d/i=3, d/2=80, which 
is F(0.05,3,80)= 2.72 , or the p- value is < 0.05. Thus the fitted models 4.2 and 4.3 
are appropriate for the data of each line. 



4.3. Detection of outlier values: Outlier values can be detected in the data by 
drawing the graphs of standardized residuals, resulting from using each method of 
estimation OLS, M, MM, LTS mentioned above. The graphs performed by repre- 
senting the estimated values of the response variables (fitted values) on the hori- 
zontal axis (X-axis) , and the values of standardized residuals on the vertical axis 
(F-axis), the values that fall outside the cut-off points (A±2.5) considered as out- 
liers. 



4.3.1. Ordinary Least Squares OLS method: 

Table 4.5: Outlier values and its estimated values with standardized residual 
values for furniture's line by using OLS. 



No. 


Year 


Month 


Observed value Y 


Estimated valucYoLS 


14 


2005 


2 


14231 


28828.7 


19 


2005 


7 


10048 


25421.7 


67 


2009 


7 


89282 


73837.2 


79 


2010 


7 


51310 


30385.5 



Table 4.6: Outlier values and its estimated values with standardized residual 
values for door's line by using OLS. 



No. 


Year 


Month 


Observed value Y 


Estimated valueYoi,S 


46 


2007 


10 


741930 


231489 


47 


2007 


11 


1110620 


602866 


53 


2008 


5 


12633 


573055 



4.3.2. M-estimation: By applying the M-estimator robust method, by using SPSS 
(version 17) , S-plus (version 8.0, 2007 professional), and Minitab (version 13.2), 
the fitted linear model for each production line, is as the following: 
Furniture's line model: 



(4.4) Yi = -6031.1450 + 34.8230Yi + 11.8254A 2 + 18.3600A 3 

Door's line model: 



(4.5) 



Yi = 2887.2143 + 16.9561JG + 3.8971A 2 + 1.0568A 3 
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The fitted models in 4.4 and 4.5, used to generate the residuals by M-estimator 
method, then the standardized residuals compute. 



Table 4.7: Outlier values and its estimated values with standardized residual 
values for furniture's line by using M-estimator. 



No. 


Year 


Month 


Observed value Y 


Estimated valueYj^ 


19 


2005 


7 


10048 


23015.6 


54 


2008 


6 


36715 


20547.9 


67 


2009 


7 


89282 


68579.5 


79 


2010 


7 


51310 


27804.3 



Table 4.8: values and its estimated values with standardized residual values for 
door's line by using M-estimator. 



No. 


Year 


Month 


Observed value Y 


Estimated valueY/vf 


40 


2007 


4 


1352330 


23119.4 


41 


2007 


5 


1040410 


19278.6 


42 


2007 


6 


1160110 


20921.5 


43 


2007 


7 


1475250 


26699.3 


44 


2007 


8 


1129590 


20899.2 


45 


2007 


9 


1229060 


21663.0 


47 


2007 


11 


1110620 


18385.2 



4.3.3. MM- estimation: By applying the MM-estimator robust method, by using 
SPSS (version 17) ^-plus (version 8.0, 2007 professional), and Minitab (version 
13.2), the fitted linear model for each production line is as the following: 

Furniture's line model: 



(4.6) Y t = -6312.1664 + 36.7890Yi + 12.0964Y 2 + 19.4700Y 3 

Door's line model: 



(4.7) 



Yi = 2894.1356 + 17.4564Yx + 4.0552Y 2 + 1.1582X, 



The fitted models in 4.6 and 4.7, used to generate the residuals by MM-estimator 
method, then the standardized residuals computed. 



Table 4.9: Outlier values and its estimated values with standardized residual 
values for furniture's line by using MM-estimator. 



No. 


Year 


Month 


Observed value Y 


Estimated valucYjvfM 


19 


2005 


7 


10048 


24114.33 


67 


2009 


7 


89282 


72091.02 


79 


2010 


7 


51310 


29173.59 



Table 4.10: Outlier values and its estimated values with standardized residual 
values for door's line by using MM-estimator. 
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No. 


Year 


Month 


Observed value Y 


Estimated valueYMM 


40 


2007 


4 


1352330 


23950.94 


41 


2007 


5 


1040410 


19351.98 


42 


2007 


6 


1160110 


21197.29 


43 


2007 


7 


1475250 


27606.55 


44 


2007 


8 


1129590 


23018.05 


45 


2007 


9 


1229060 


23058.02 


47 


2007 


11 


1110620 


19815.32 



4.3.4. LTS estimation: By applying the LTS-estimator robust method, by using 
SPSS (version 17), 5-plus (version 8.0, 2007 professional), and Minitab (version 
13.2), the fitted linear model for each production line is as the following: 

Furniture's line model: 

(4.8) Y t = -6809.2556 + 36.9543^! + 12.4873Y 2 + 19.4401X 3 

Door's line model: 



(4.9) 



Y t = 1883.232 + 18.580Xi + 12.362Y 2 - 2.230Y 3 



The fitted models in 4.8 and 4.9, used to generate the residuals by LTS'-estimator 
method, then the standardized residuals computed. 

Table 4.11: Outlier values and its estimated values with standardized residual 
values for furniture's line by using TTS-estimator. 



No. 


Year 


Month 


Observed value Y 


Estimated valueYiTS 


19 


2005 


7 


10048 


23960.17 


54 


2008 


6 


36715 


21341.25 


67 


2009 


7 


89282 


72242.47 


79 


2010 


7 


51310 


29040.33 



Table 4.12: Outlier values and its estimated values with standardized residual 
values for door's line by using LTS'-estimator. 



No. 


Year 


Month 


Observed value Y 


Estimated valueY^s 


40 


2007 


4 


1352330 


23950.8 


41 


2007 


5 


1040410 


19351.8 


42 


2007 


6 


1160110 


21197.1 


43 


2007 


7 


1475250 


27606.3 


44 


2007 


8 


1129590 


23017.9 


45 


2007 


9 


1229060 


23057.8 


47 


2007 


11 


1110620 


19815.2 



The number of detected outliers by using four methods for both production lines 
can be summarized in the following table: 

Table 4.13: Detection of outliers by different methods for both production lines 



Production line method 


OLS 


M 


MM 


LTS 


Furniture 


4 


4 


3 


4 


Doors 


3 


7 


7 


7 
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4.4. Comparison of the results of OLS method and robust methods M- 
estimator, MM-estimator and iTS'-estimator: After estimation the coeffi- 
cients of multiple linear equation with four methods, the results can be compared 
by using the criteria mentioned in (chapter two) of theoretical part of the paper, 
in order to judge the estimators, the following tables show the estimated values 
of coefficients by using these methods, and values of comparative criteria for each 
production line. 



Table 4.14: The estimated values of coefficients by using four methods for 

furniture's line 



Variables 


Coefficients 


OLS 


M 


MM 


LTS 


Constant 


/3 


-7283.000 


-6031.145 


-6312.166 


-6809.257 


*i 


Pi 


36.300 


34.823 


36.789 


36.954 


x 2 


02 


16.372 


11.825 


12.096 


12.487 


x 3 


03 


17.532 


18.360 


19.470 


19.440 


MSE 




33160290 


37280005 


34141334 


34260526 


MAPE 




1.0495 


0.9826 


1.0312 


1.0034 


MSAE 




3961.3100 


3975.4900 


3896.2500 


3894.5200 



Table 4.15: The estimated values of coefficients by using four methods for door's 
line. 



Variables 


Coefficients 


OLS 


M 


MM 


LTS 


Constant 


00 


-191597.000 


2887.214 


2894.136 


1883.232 


Xi 


0i 


1034.900 


16.956 


17.546 


18.580 


x 2 


02 


-1178.600 


3.897 


4.055 


12.362 


x 3 


03 


679.500 


1.057 


1.158 


-2.230 


MSE 




1.4210E+11 


1.41584E+11 


1.41420E+11 


1.41389E+11 


MAPE 




23.3736 


0.2127 


0.2134 


1.1806 


MSAE 




284510 


118247 


118150 


120928 



5. Conclusions and Recommendations 

5.1. Conclusions: According to the results of the practical application of the 
study the following conclusions are yielded: 

1- The number of outlier values detected by using the four methods in the data 
for furniture's line is very close, by observing table 4.13, and the proportion of 
outlier values is 4.77. This refers to the fact that the distribution of standard errors 
is close to the normal distribution as shown in graph 4.1. 

2- From the table 4.13, it is noticed that the number of outlier values found in 
the data for door's line, by using OLS is 3 values, and the number of outlier values 
detected using the robust methods equals to 7 values and its proportion is 8.33, 
this means that the distribution of standard errors is somewhat distant from the 
normal distribution as shown in graph 4.3. 

3- The appearance of the negative sign for j3 in the table 4.14 is logical, this 
means that the absence of the explanatory variables (stopping production) for fur- 
niture's line. This mean there would be a loss. 
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4- It is noticed from table 4.14, that the estimated values for each parameter 
using the four methods are very close, and the reason is that the number of outlier 
values in the data is a few, as noted in the first conclusion above. 

5- The appearance of the positive sign for /3 in the table 4.15, by using the 
robust methods is illogical, the researcher believes that the reason is due to the fact 
that the data for doors line are inaccurate values. 

6- It is noticed from the table 4.15, that the estimated values using OLS are very 
far from its estimated values by using the robust methods, and that is a logical thing, 
because of the large number of outlier values discovered in the data for door's line, 
and it is also noticed that theLTS method gave better results by using the crite- 
rion MSE, and theM method gave better results by using the critcrionMAP_E, 
the criterion value is very close to its value by using MM method, it is also noticed 
that by using the criterion MSAE, the MM method is better. 



5.2. Recommendations: According to the results obtained in the practical part, 
the researcher recommends are as follow: 

1- Studying all robust methods commonly used, and applying them to a large 
number of data, in order to clarify the efficiency of each method. 

2- Using simulation procedure to generate data, and then contaminating them 
with the outlier values, at different percentage, and applying robust methods com- 
monly used to them, and then comparing the results of these methods. 
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HYBRID METHODS FOR SOLVING VOLTERRA INTEGRAL 

EQUATIONS 

GALINA MEHDIYEVA, MEHRIBAN IMANOVA, AND VAGIF IBRAHIMOV 



Abstract. It is known that there exists a class of methods for solving integral 
equations with variable boundary. One of them is the most popular methods 
of quadratures. This method is clarified and modified by many scientists. 

Here, to numerically solving Volterra integral equations the hybrid method 
is applied and constructed a concrete method with the degree p = 6 and p = 8, 
by using information about the solution of integral equations only one previous 
point. 



1. Introduction 

Consider the numerical solution of the following nonlinear Volterra integral equa- 
tion: 

X 

y(x) = f(x)+ K(x,s,y(s))ds, x€[x ,X]. (1) 

Xo 

Assume that the problem (1) has a unique continuous solution y(x) determined 
on the interval [xq, X]. By means of a constant step < h the interval [xo, X] divide 
into iV equal parts with the mesh points x% = Xq + ih (i = 0, 1, . . . , N). Denote 
by yi the approximate and by y{xi) exact value of the solution of problem (1) at 
the points Xi (i = 0, 1, 2, ..., TV). There are many papers dedicated to investigation 
numerical methods for solving integral equations (1) (see f.e. [l]-[6]). 

The first hybrid method for solving integral equation (1) constructed Makroglou 
[7], but that in [8] is generalized in the following form: 

k k k j — Vi 

^2a,y n+l = ^Olifn+i + h^ EI Pi K ( x n+j, x n+h,y n +h) 

i=a i=o j=a i=o 

(!i = i + vi, H <i). (2) 

Method suggested here has the following form: 

k k k j 

^onvn+i = ^a t /„ +1 + ^E^' K(%n+j,x„ +i ,y„ +i )+ 

i=0 i=0 j=0 i=0 

k j-Vi 

+ h Yl EI i { i ] K (. X n+j,Xn+U,y n +h)- (3) 

j=0 s=0 



Key words and phrases. Volterra integral equations, hybrid methods, degree of hybrid methods. 
This research has been supported by the Science Development Foundation of Azerbaijan (EIF- 
2011-l(3)-82/27/l). 
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2. Constructing a hybrid method of the type (3) 

It is known, that hybrid methods such as (3) can be constructed by different 
ways. Here the calculation of the next function 

X 

tp(x) = / K(x,s,y(s))ds 

X 

on the interval [xoj-X] by the some structures are reduced to the computation of 
functionsy)(a;) on the interval [x — kh, x] (here k - limited quantity). To this end, 
consider the following difference: 

Xn 

y{x n +\) - y(x n ) = g(x n +i) - g(x n ) + / (K(x n+1 , s, y(s)) - K(x n , s, y(s)))ds+ 

Xo 

x n +l 

+ / K(x n+1 , s, y(s))ds. (4) 

x n 

By the application of Lagrange's theorem we can write: 

K(x n+1 , s, y(s)) - K(x n , s, y(s)) = hK' x (£, s, y(s)) (x n < £„ < x n + h). 
Then, by using the received equality in (4) we have: 

Xn 

y(x n+ i) - y{x n ) = g(x n+1 ) - g(x n ) + h K' x (£ n , s, y(s))ds+ 

Xo 

x n+ i 

+ / K(x n +i, s, y(s))ds. (5) 

X n 

We assume that by any method we have found the solution of equation (1). 
Then, taking into account its in (1) we can write the following: 



y' 0*0 = g'( x ) + K ( x > x > 2/0*0) + / K L(x, s > y( s )) ds - 

Xo 

It is easy to show that 

X n 

hJK' x (Z n , s, y(s))ds = h(y'(U-9'(U - K(t n , £„, y(U))- 

Xo 

in 

-h J K' x {i n , s, y(s))ds. (6) 

Xn 

To take into account the relation (6) and the following 

hK 'x(t,m s ' y( s )) = K ( x n+i, s, y(s)) - K(x n , s, y(s)) 
in equation (1), we have: 
y(x n+ i) = y(x n ) + g(x n+1 ) - g{x n ) + %'(£„) - g'(£ n )) - hK(£ n , £ n , y(£ n ))+ 
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+ / K(x n , s, y(s))ds + / K(x n+1 , s, y(s))ds. (7) 

Thus, determined the solution of equation (1) on the segment[xo, cc re +i] to reduce 
it's determined on the segment [x n , x n + h\. Using one of the known formulas for 
calculating integrals involved in equation (7) and using the following equality (see 
f.e. [9, p.325]): 

k 
Z(x„+l) - Z(X„) - hz'(£ n ) = ^ a i z n+l 

i=0 

in equality (7) we have: 

k k 

^ai|/ n +j = ^.a-ign+i - hK(x n+ i, x n+h y n +i) + 

i=0 i=0 

k k 

+h^2^2p ( i 3) K(x n+jl x n+ll y n+i ). (8) 

Here a,, pf (j, i = 0, 1, 2, ..., k) are some real numbers, the parameter / satisfies 
the condition < I < 1, and 

g m = g(x m ) (m = 0, 1, 2, ...). 
However, by the help of next relation (sec. [9, p. 375]) 

k 
K(x n+ i, x n+ i, y n+ i) = '^ J /3 i K(x n+ i, x n+i , y n +i), 

i=0 

method (8) can be written in the form: 

k k k k 

^aiyn+i ^^a l g n+l + h' s ^2^2l3X ) K{x n+ j, x n+l , y n+i ). (9) 

j=0 i=0 j=0 s=0 

To calculating the integrals involved in (7), we can apply different formulas. If 
applied to the calculation of integrals the quadrature methods with the fractional 
steps, then the formula (9) can be rewritten as: 

k k k k 

^2a t y n+i =^2aig n+ i + h^2^2/3y'K(x n+j , x n+i+Vi , y n+i+Ui ), (10) 

i=0 i=0 j=0 i=0 

here the coefficients on, pf (i, j = 0, 1, 2,..., k) - are some real numbers, and 
\vi\ < 1 (i = 0,l,...,k). 

Note that in (8) is involved as a member K(x n +i, z n +i, y n +i) i\l\ < 1), which 
is different from the terms of the sum in (10), in that here in the first argument 
used by the so-called fractional step. However, using the Lagrange interpolation 
formula can get rid of fractional steps. But if we to take into account the specified 
property of one term of the method (8) and generalize it, then the method (10) can 
be rewritten as: 

k k k k 

^2ony n+l = ^ongn+i + h^2^2p 1< f ) K(x n+:j+l:jl x n+i+Vi , y n+i+Vi ), (11) 

2=0 i=0 j=0 i=0 
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where \lj\ < 1 (j = 0, 1, ..., k). If set lj = (j = 0, 1, ..., k), then from the relation 
(11) be it follows the method (10). It is easy to determine if generalize the methods 
(10) and (11), in one embodiment receive the following method: 

^ aiVn+i = ^ aqgn+i + h y~] y~] PX'K(x n+ j , x n+i , y n +i)+ 

i=0 i=0 j=0 i=0 

fe fc 

+^EE 7 i ■ K '( a; n+J+^' Zn+»+i/ 4 , Vn+i+vt) ■ (12) 

j=0 »=0 

The coefficients of these methods determined by the same scheme for lj = Vj 
(j = 0, 1, ..., k). Before we propose a method for determining the coefficients 
a ij Pi i li (h j — 0, 1, 2, ..., k), we consider some bounded on the coefficients of 
the method (12) and set U = Vi (i = 0, 1, ..., k). This is connected to the fact 
that, the method for determining the coefficients (11), we use some facts from the 
theory of difference equations and apply the following finite-difference method with 
constant coefficients to the solving of equation (1): 

k k k 

Y^ CXiZn+i =h^ PiZ'n+i + h Yl li z 'n+i+v z ■ (13) 

i=0 j=0 i=0 

We show that after applying the method (13) to the solving equation (1) we 
receive be the method of the type (12) and in this case between the coefficients of 
the methods (12) and (13) will be the next relations: 

k k 

|8i = E^' ) ; 7i = E^ } (* = 0,1,2,...,*;). (14) 

3=0 j=0 

Now, suppose that the method (13) is converges. Then its coefficients satisfy the 
following conditions: 

A: The values of the quantities a,, PlJ^i = 0, 1, 2, ..., k) are real numbers, more- 
over, at 7^ . 

B: Characteristic polynomials 

k k k 

p(A) = 5>* A< ' <7(A)^/3 4 V, 7(A) = E^ A<+,< ; 

i=0 i=0 i=0 

have no common multipliers different from the constant. 

C: cr(l) + 7(1) ^ andp > 1 are holds. 

To determine the values of the coefficients Qj, j3 it 7^ (i = 0, 1, ..., k) using the 
method of undetermined coefficients, and to this end, consider the following Taylor 
expansion of functions: 

y(x + ih) = y(x) + ihy'(x) + ^V(z) + ... + ^V%) + 0(^ +1 ), (15) 

y'(x + hh) = y'(x) + hhy"(x) + ^V'(z) + ... + S^2/ (j %) + 0{h"), (16) 
where x — x + nh - is fixed point, but l { = i + v^ (i = 0, 1, 2, ..., k). 
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For determining the values of the parameters a,, ft, U (i — 0, 1, 2, ..., k, U = 
i + Vj), take into account equalities (15) and (16) in the next asymptotic equality 

k 

V" {otiy(x + ih) - hf3 i y / (x + ih) - h-y^^x + l { h) = 0(h p+1 ), h -> 0, 

i=0 

here a; = x + nh the fixed point and p is the degree of the method (13). 
Then have: 

5>=0; ]T £«,= £ T^uA + E #^,^ (" = 1,2,...,P, 0! = 1). 

i=0 i=0 ' i=0 V '' i=0 V '■ 

(17) 
Thus, for determining the values of the on, ft, k (i = 0, 1,2, ..., k) we get a 
homogeneous system of nonlinear algebraic equation where the amount of unknowns 
equals 3fc + 3, but the amount of equation^ + 1. Obviously system (17) always has 
a trivial solution. However the trivial solution of system (17) is not of interest. 
Therefore we consider the case when system (17) to have a non-zero solution. It is 
known, that the system (17) has the non zero solution if the condition 

p<4fc+3 (18) 

is holds. Hence it follows that p max = 4fc + 2, but for ft = 0(i = 0, 1, ..., k) is holds, 
that Pmax = 3fc + 1. 

Now to consider construct a concrete method for k = 2 for ft = (i = 0, 1, 2). 
Then from system (17) we have: 

72+7i +7o = 2 "2 +ai, 

^272 + ^i7i + ^o7o = 2^ 2a2 + ai )' 

^27 2 + ^i7i + ^o7o = gl 23 ^ + «i), 

"27 2 + "?7i + "o7o = ^(2%+ai), (19) 

^27 2 + ^i7i + ^o7o = ^(2% + «i), 
5 

^272 + "?7i + ^o7o = g(2% + ai). 

Selecting a2, ai and «o we can construct different methods with the degree p = 6. 
For a 2 = 1) oi\ = ao = —1/2 solving system (19) we get a complex solution, but 
for «2 = 1, Qfi = — 1 and ao = the solution of system (19) has the following form: 

ft = 5/18, ft =8/18, 0o = 5/18, 



h = 3/2 + V15/10, ?i=3/2, Z = 3/2 - V15/10. 
Taking into account these values in formula (13) we get the following method: 

Vn+2 = Vn+1 + HW n+3/2+VIE/10 + 8y'n+3/2 + 5 ^+3/2-7l5/lo)/ 18 - ( 20 ) 

Remark, that these method can be rewrite as one step method in the next form: 

Vn+1 = Vn + HW n+1/2+VTE/10 + 8y'n+l/2 + 5 ^+l/2-VT5/lo)/ 18 - 

This method is stable and has the degree p — 6. 
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3. Algorithm for using hybrid methods 

Construct an algorithm for using method (20). Assume that j/j is known, and 
consider the calculation of y n +2 {n = 0,1,2,...). For applying method (20) the 
values of the quantities y n ± a , should be known and they are determined by means 
of the following formula: 

y n+a =y n + ^hy' n + a 2 h((h 2 - 12a + 6)y , n+3/2 - (3a 2 - 48a + 27)y , n+1 + 

+ (3a 2 - 60a + M)y' n+1/2 - (a 2 - 24a + 33)i4)/18, (21) 

where a = 3/2 ± >/l5/10. 

Thus, we see that for using method (21), we must construct formulae for calcu- 
lating the values of y n+ i, y n +i, 2/ ra +a with accuracy 0(h 5 ). If j/iare know, then 
for n > 1 it is enough to calculate the values of y n +\ and y n+ 3 . Therefore, at first 
consider the definition of the value of the quantity y n +i and to this end we use the 
following sequential formulae: 

1- jjn+i =y n + hy' n , 

2. y-n+i =y n + h(y' n + y' n+1 )/2, 

3- fin+l =y n + KVn + A y' n +\ + £™+l)/ 6 > 

4. y n +i = y n + h(y' n + 4:y' n+ i + y' n +i)/§, 

5. y n+ i = y n+ i + h(7y' n+1 - 2y , n+ i + y' n )/6, 

6. y n+ i = y n+1 + h(9y' n+i + lV n +i - $y' n+ i + l/ n )/48. 

Conclusion 1. Given that stable hybrid methods are more accurate and have ad- 
vanced the stability region, here for numerical solution of equation (1) suggested 
a method that makes use of hybrid methods. Note that the methods of type (12) 
belong to the class of methods of type (13). To constructed type methods of type 
(12) we must to solve system (14), whose solution is not unique. However, the 
solution of (17) may be unique. This means that when the method of type (13) is 
unique corresponding method of type (12) is not unique. Therefore, both theoreti- 
cal and practical interest is the finding maximum value of degree of stable method 
of type (12). Also of interest to determine the stability region of method (12). 
Consequently, the investigation of method (12) is a priority in the theory of numer- 
ical methods. Remark, that the method (20) has the new characteristic. The first 
of them including in used method (20) as one step method, but in these case for 
the determine quantities y„ + i/2±yr5/io by the method (21) the values of quantities 
Vn+3/2> Vn+i JTiust be known. Because here suggested the method as the (20), but 
the next characteristic of the method (20) including in definition the method (20) 
as the explicit, but the method (21) for any values of parameter a and the method 
applying for finding y n+ i andy n+3 / 2 are implicit. Consequently algorithm for using 
method (20) is implicit. 
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ON THE NUMBER OF REPRESENTATIONS OF AN INTEGER 
OF THE FORM x 2 + dy 2 IN A NUMBER FIELD 

SARAT SINLAPAVONGSA AND AJCHARA HARNCHOOWONG 



Abstract. This paper tells us about the number of representations of an 
algebraic integer of the form x 2 + dy 2 in a number field where d is a positive 
rational integer. 



1. Introduction 

There are many papers which give the criteria to determined whether an alge- 
braic integer can be represented in the form x 2 +dy 2 such as [1], [2], [3], [4]. Another 
interesting problem about integers of the form x 2 + dy 2 is to study the number of 
these representations. T. Nagell [5] study the problem of the number of represen- 
tations of an integer which can be represented as the sums of two squares. For this 
paper we generalize the result of T. Nagell to the representations of an algebraic 
integer in a number field of the form x 2 + dy 2 where d is a positive rational integer. 

2. Preliminaries 

Let u> be an integer in a number field K and d a positive rational integer. We 
say that u> has a representation of the form x 2 + dy 2 if there are integers a and j3 in 
K such that to = a 2 + dji 2 . The representations u> = x 2 + y 2 with x = ±a, y = ±/3 
and x = ±/3, y = ±a and the representations u) = x 2 + dy 2 for d > 1 with x = ±a 
and y = ±/3 are considered to be one and the same. The relation 1 = l 2 + d ■ 2 is 
called the trivial representation of the number 1. 

3. Main Results 

These are the main results of the paper. 

Theorem 3.1. If 1 has more representations of the form x 2 + dy 2 than the trivial 
representation, then 1 has infinitely many representations. 

Proof. Assume that 

1 = 7 2 + dS 2 

where 7 and 5 are integers in K such that 7 7^ 1 and S =/= 0. 
For a positive integer n, we define 

In + S n V^d = (7 + 6s/^d) n , 

where 

n ^ ,n-2r2j ( n \ „,n-4r4 j2 



(3.1) 7« = 7" - ( 2 r-'Fd + 4 ] 7"" W - 



Key words and phrases, algebraic integer, totally real, number field. 
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and 

(3.2) s n = i '[ )r- 1 s-[ :: ] 7 "- 3 ^+-... 



n \ „_i t. n 



Then 
and 



7„ - 5 n V^d = (7 - (5v /= d) r 



(7n + ^\/ Z rf)(7« - S n V^d) = (7 + <5^)"( 7 - <5^)" - (7 2 + ^ 2 )"- 
Therefore: 

ll + d5 2 n = l. 
Thus the Diophantine equation 

(3.3) x 2 + dy 2 = 1 

has the integral solutions 

x = j n ,y = S n . 
Next, we will prove that these solutions are all different. 
Suppose for a contradiction that there are to, n € N such that to 7^ n and 

"im 7ni ^m ^n- 

Then 

( 1 + 5^d) m = ( 1 + SV^d) n , 
and so 7 + 5\/—d is a root of unity. Suppose that 

7 + 5\/^rf = c 

is a primitive JVth root of unity. Since 
we get 

Thus 

l(C 2 -i) = V^d(s 

is an algebraic integer. 

If N is a power of 2 and N > 8, then the number 

^(C Ar/4 -i) = ^(±*-i) 

must also be an algebraic integer. This is a contradiction. 
If N is divisible by the odd prime p, then the number 

\{e N/p -i) 

must also be an algebraic integer but ^(( 2N / p — 1) is the root of the irreducible 
polynomial 

-^ [(2a; + If - 1] = 2P- 1 x p- 1 + . . . + p(p - l)x + p 

Ix 

which has integral coefficients. This is a contradiction. 

Finally, if AT = 1,2,4, then 7 = or <5 is not an algebraic integer. This is a 

contradiction. □ 
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Theorem 3.2. Let K be a number field. 

(i) Let tt be a prime in K such that tt has the representation of the form 
x 2 + dy 2 . If 1 has only trivial representation, then tt has exactly one repre- 
sentation. Otherwise tt has infinitely many representations. 

(ii) Let lu be an integer in K such that to has the representation of the form 
x 2 + dy 2 . If 1 has only trivial representation, then ui has a finite number of 
the representations. Otherwise to has infinitely many representations. 

Proof, (i) Assume that 1 has only trivial representation Let 7r be a prime in K such 
that tt has two representations of the form x 2 + dy 2 , 

tt = a\ + d/3 2 

and 

tt = a 2 + df3 2 
where 011,012, Pi, 02 are integers in K. Then 

<Pl ~ 01) = o?idt - alPl 
Since tt is a prime, either of the aip2 — a 2 fa and ai/?2 + CK2P1 must be divisible by 
tt. Without loss of generality we may assume that 

ai/32 = Q2/01 mod tt. 
Multiplying together the two representations of tt, we get 

tt 2 = (aia 2 + df3i/3 2 ) 2 + d(ai(3 2 - a 2 fa) 2 - 
Since ai/?2 — o 2 fa is divisible by tt, so is the number aia 2 + d/3if3 2 . We put 

aia 2 + dPiP 2 — 7T7 and ai/?2 — ct2/3i = ttS, 
where 7, 5 are integers in K, we get 

l=j 2 + dS 2 . 
Since 1 has only trivial representation, 7 = ±1 and 5 = 0. Therefore 

aia 2 + dP\P 2 = ±7r and aij3 2 — a 2 j3i = 0. 
Then 

a 2 = -—ail and — a\ + dfiifa = ±7r, 

Pi Pi 

and so 

fa fa I 2 , j«2\ fa 2 1 jo a _l_ 

—tt= —(a 1 +dl3 1 ) = —a x + d/3i/3 2 = ±tt. 
Pi Pi Pi 

Hence /3 2 = ±/3i and a 2 = ±ai and so tt has exactly one representation. 

Suppose next that the equation (3.3) has an infinitely of solutions x = 7„,y = 5 n 

given by (3.1) and (3.2). Let tt be a prime in K such that 

tt = a 2 + dfa 2 

where a and j3 are integers in K. For a positive integer n, we define 

a„ + ftn^J-d = {in + S n V^d)(a + /3\/^d) 



where 
Thus 



ot n = aj n - df36 n and (3 n = a8 n + f3-f n . 
a n - /3 n \/ z d = (7„ - 5 n ^/^d)(a - fiV^d) 
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and 

(a„ + f3 n y^d)(a n - /3„\/ z d) = (~/ 2 + dS 2 )(a 2 + d/3 2 ) = n. 
Hence 

Tr = a 2 n + dj3 2 n . 

We will show that these are all different representations of it. 

Suppose for a contradiction that there are to, n £ N such that m ^ n and 

C^m — &n ? Pm Pn • 

Then we get 

7m Tfi5 ^m ^fi* 

But in the proof of Theorem 1 , j m — 7„ , S m — 5 n . where to ^ n leads to a 
contradiction. Therefore 7r has infinitely many representations, 
(ii) Assume that 1 has only trivial representation. Let lo be an integer in K. 
Suppose for a contradiction that lo has infinitely many representations, i.e., 

w = al + d(3l,ne N 

where a„ and j3 n are integers in if and for m =/= n, a n =/= ±a m and j3 n ^ ±fi m . 
Since OkI^Ok is finite, there are to, n € N such that m ^ n and 

(3.4) a m = a„ mod w and /3 m = /?„ mod lo. 

Multiplying the two represeentations 

lo = a m + dfi 2 ^ and lo — o? n + dfS n , 

we get 

lo 2 = (a m a n + d/3 m /3 n ) 2 + d(a m l3 n - a n /3 m ) 2 . 
It follows from (3.4) that the two numbers 

a m a n + d(3 m (3 n and a m (3 n - a„/3 m 
are divisible by lo. Hence we may put 

a m a n + d(3 m (3„ = loj and a m j3 n - a n j3 m = loS 
where 7 and 5 are integers in K. Then 

l=j 2 + dS 2 . 
Since 1 has only trivial representation, 7 = ±1 and 5 = 0. It follows that 

a m a n + dfimPn = ±w and a m (3 n - a n (3 m = 0. 
Then 

Pn _ i Pn 2 



and so 



a n = — a m and — a m + di3 m /3 n = ±w, 

Pm Pm 

P™ P™ ('2 1 jq2 \ P" 2 1 jo o 1 

Pm Pm Pm 



Hence f3 n = ±f3 m and a„ = ±a m . This is a contradiction and so the number of 
representations must be finite. □ 

Theorem 3.3. Let K be a number field and d a positive rational integer. The 
following statements are equivalent. 

(i) K = Q(V—d) or K is totally real. 

(ii) 1 has only trivial representation in K . 
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Proof. Let K be a number field of degree n. Assume that K = Q(V— d) or K is 

totally real. 

Case 1: K = Q(v / — d). Suppose that 

a 2 + d/3 2 = 1 

where a and j3 are integers in K. Then 

(a + Pv^d)(a - pV^d) = 1. 

Thus a + /3y/—d and a — f3\/—d arc units in K . For d ^ 1 , 3, the units in K are ±1 
so we have the following system 

a + f}\f— d = 1 and a — f3\/—d = 1 

or 

a + fi\J —d = — 1 and a — f3\/—d = — 1 . 

For d = 1,3, we have more cases to figure out. Nevertheless, in either cases we have 
a = ±1 and /3 = 0. Hence 1 has only trivial representation. 
Case 2: K is totally real. Suppose that 

a 2 + d/3 2 = 1 

where a and (3 are integers in K. Then the conjugate equations 

l = ( a W) 2 + d(/3«) 2 

also hold. Since the conjugates are all real, for d > 1 we get 

1 



(fc) 



< ^ <1 



for every value of fc. Thus f3 — and a = ±1. Hence 1 has only trivial representa- 
tion. 

For the converse, assume that K ^ Q(\/—d) and K is not totally real. We will 
prove that 1 has a nontrivial representation. 

Case 1: \J— d € K. Since K ^ Q(^/—d), n > 4 and so t > 1 where t is the rank of 
the unit group of K. Thus there is a unit e in K such that e is not a root of unity. 
Then the equation 

1 = a 2 + d/3 2 
is satisfied by the following numbers: 



«=\W 



') 



and 



13 



1 



i(e r ' 



2V-d 

where m is the order of the group {O k /1>\/ —dO k)* ■ Note that /3 is an integer in 
K because 

e m = 1 mod 2V^rf and e~ m = 1 mod l^f^d 

and a is an integer in K because a = y/—d(3 + e~ m . Since e is not a root of unity, 
/3 7^ 0. Hence 1 has a nontrivial representation. 

Case 2: >/— d ^ K. Let r be the number of real embeddings of K, s the number of 
nonconjugate complex embeddings of K and t = r + s — 1 the rank of the unit group 
of K. Let L = K(y/—d). Then the field L has degree 2n. Let R be the number 
of real embeddings of L, S the number of nonconjugate complex embeddings of L 
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and T = R + S — 1 the rank of the unit group of L. Since \J — d £■ R, i? = and 
5 = r + 2s and so 

T = i? + S'-l=r + 2s-l = t + s. 

Since K is not totally real, s > 1 and thus 

T>i. 

Let us consider the ring consisting of the numbers in L of the form A + p\/—d, 
where A and p are integers in K. The unit group G of this ring has the rank T. 
The subgroup G\ consisting of the squares of the units in G clearly has the same 
rank T. The units in G\ cannot all be equal to the product of a unit in K and a 
root of unity since t <T. Hence we conclude that there exists a unit E = a + by/^d 
in the ring, a and b integers in K such that o^l and b ^ 0, and such that E 2 
is not equal to the product of a unit in K and a root of unity. Then the number 
Ei = a — b\J —d is also a unit in L. Hence a 2 + db 2 is a unit in K. Then the number 



is satisfied by the following numbers 





1 


= a 2 


+ d/3 2 


in 


ibers 








a = 


rp2m _i_ rp2m 




2(a 2 


+ db 2 ) m 


P 




Tp2m Tp2m 


2\ 


/=d( 


a 2 +db 2 ) 



and 



where m E N. Since a 2 + db 2 is a unit in K, a and j3 are integers in K. If j3 = 0, 
then E 2m — E 2m . Hence EE^ 1 must be a root of unity and 

E 2 = {a 2 + db 2 ){EE^) 

is a product of units and a root of unity. This is a contradiction. Thus /3 ^ and 
so 1 has a nontrivial representation. □ 
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ON THE MINIMUM RANK AMONG POSITIVE SEMIDEFINITE 
MATRICES AND TREE SIZE OF A GIVEN GRAPH OF AT MOST 

SEVEN VERTICES 

XINYUN ZHU 

Abstract. In this paper, we study the minimum rank among positive semidefinite 
matrices with a given graph of at most seven vertices (msr) and we list msr for all simple 
graphs with at most seven vertices which were listed in [1]. For a simple connected 
graph with no pendent vertices, we study how large ma.xc^g n (msr(G) — ts(G) + 1) 
would be, where ts(G) (see Definition 1.1) denote the tree size of a graph G 



Key words, rank, positive semidefinite, graph of a matrix 

AMS subject classfication: 15A18,15A57,05C50 

1. Introduction 

Given a connected graph G with n vertices, we associate to G a set %{G) of Hermitian 
n x n matrices by the following way, 

H(G) = {A\A = A*, ctij 7^ for i ^ j if and only if (i,j) is an edge of G} 

where A* is the complex conjugate of A and a^ is the ij -entry of A. We define V(G) 
be the subset of %{G) whose members are positive semidefinite matrices. For a given 
graph G, V(G) is non-empty because laplacian matrix L(G) G V(G). Define 

msr(G) = min rank(A) 

AeP(G) 

There are two main results of this paper which were given in Section 3 and Section 4 
separately. 

A complete list for msr(G) for all the simple graphs with less than seven vertices 
which were listed in [1] has been given [2]. In this paper, we calculate msr(G) for all the 
simple graphs with at most seven vertices which were listed in [1], see Section 3. This is 
one main result of this paper. 

To introduce anther main result of this paper, we need the following preparation. 

Definition 1.1. An induced subgraph H of a graph G is obtained by deleting all vertices 
except for the vertices in a subset S. For a graph G, we consider its tree size, denoted 
ts(G), which is the number of vertices in a maximum induced tree. 

Lemma 1.1. [2, Lemma 2.1] If H is an induced subgraph of a connected graph G, then 
msr(H) < msr(G). 

Given a simple connected graph G, we can define the tree size ts(G) of G. It is known 
that for a tree T, msr(T) = ts(T) — 1. Hence by Lemma 1.1, we get ts(G) — 1 < msr(G). 
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Lemma 1.2. [2, Corollary 3.5] If a simple connected graph G has a pendant vertex v, 
which is simply vertex of degree 1, then msr(G) = msr{G — v) + 1. 

Let Q n denote the set of all the simple connected graphs with n vertices and with no 
pendent vertices. We will study max GG g n {msr{G) —ts(G) + 1). We answer this question 
for all the members in Q n with n < 7 in Theorem 4.1. By giving a counter-example in 
Section 5, we show that Theorem 4.1 is not true if the vertices number is bigger than 7. 
For a simple connected graph with no pendent vertices, we ask under what conditions, 
Theorem 4.1 is still true in Section 5. 

2. Finding msr by giving vector representations 

Suppose G is a connected graph with vertex set V(G) = {v i, t> 2 , • • • , v n }. We call a set 
of vectors V = {v~i, • • • , if n } in C m a vector representation (or orthogonal representation) 
of G if for % 7^ j G V(G), (vi,Vj) ^ whenever % and j are adjacent in G and (vi,Vj) = 
whenever % and j are not adjacent in G. For any matrix B, A = B* B is a semi-definite 
positive matrix and rank(A) = rank(5), where B* denotes the conjugate transpose of 
B. For any semi-definite positive matrix A, there exists a matrix B such that A = B*B 
and rank(A) = rank(5). 

For a large family of graphs, [2] has given a good way to get msr. But [2] did not solve 
the msr problem completely. In this section we solve the msr for graphs with at most 
seven vertices but does not satisfies the conditions of graphs in [2] . We refer the reader 
to [1] to check the graphs. For the readers convenience, we list all these graphs here 
G706, G710, G817, G836, G864, G867, G870, G872, G876, G877, G946, G954, G955 
G979, G982, G992, G997 - G1000, G1003 - G1007, G1053, G1056, G1060, G1065 
G1069, G1084, G1089 - G1097, G1100, G1101, 1104, G1105, G1123, G1125, G1135 
G1145, G1146, G1148, G1149, G1152 - G1157, G1159, G1160, G1165, G1167, G1168 
G1170, G1176, G1179, G1189, G1191, G1194 - G1197 G1199, G1200, G1202, G1205 
G1207 - G1212, G1222, G1224, G1228, G1230, G1231, G1233, G1241, G1242, G1248 
G1250. We have given the msr of all those graphs by finding the vector representations 
in [3]. To illustrate how this works, we calculate msr (G946) and msr (G1104) here. We 
refer the reader to [3] to get the msr of other graphs mentioned above. 

In the following examples, for vectors Vi and Vj, by abusing notations, we write 
(vi,Vj) = Vi ■ Vj. If Vi ■ Vj = 0, then we write V{ _L Vj. Let G be a graph. Let V(G) 
be the vertices set of G and E(G) the edges set of G. Define 

N(a) = {b\b G V(G), b^a,abe E(G)}. 

Example 2.1. Let V(G946) = {1,2, 3, 4, 5, 6, 7} and let £(G946) be defined by N(l) = 
{2, 6}, N(2) = {1,3, 6, 7}, 7V(3) = {2, 4, 5, 7}, N(4) = {3, 5}, N(5) = {3, 4, 6, 7}, N(6) = 
{1, 2, 5, 7}, N(7) = {2, 3, 5, 6}. We claim 

msr(G94Q) = 4. 

The tree size of G946 is 3. First let's show that msr(G9A6) > 3. 

Let {v i, V2, t>3, t>4, t>5, vq, v^} be a vector representation of G946. We have v± _L {t> 3 , t> 4 , t> 5 , t> 7 }, 
V2 J- {^4,^5}, V3 _L {vq}, V4 _L {vq,v 7 }. If msr(G946) = 3, then t> 3 = av^ + bv^ for some 
nonzero a and b. Hence = av^ ■ v$ + bv-j ■ Vq = bv-j ■ Vq. This is a contradiction. 
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ON THE MSR AND TS OF A GIVEN GRAPH 
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Let 



B 



110 0-10 
111 
110 0-11 
10 1 1 2 



Then B T B = A, A e P(G946) and mnk(B) = 4. 

Example 2.2. Let V(C71004) = {1,2,3,4,5,6,7} and let £(G1004) be defined by 
N(l) = {2, 6, 7}, JV(2) = {1, 3, 6, 7}, JV(3) = {2, 4, 5}, N(4) = {3, 6, 7}, N(5) = {3, 6, 7}, 
N(6) = {1,2,4,5}, N(7) = {1,2,4,5}. We claim 



msr(G1004) 



The tree size of G1004 is 5. 
Let 



B 



3 
-1 



110 1 
110 1 
2 1-1 1 
110 1 -1 



Let 



A = B L B 



Then A e V(G100A) and rank(A) = 4. 



3. LIST 

In this section, we list the msr for all the simple connected graphs with seven vertices. 
To get such a list, we use some results from [2], for example, [2, Proposition 2.2], [2, 
Theorem 2.9], [2, Theorem 3.6], and the vector representation method in Section 2 and 
[3] . We would like to remind the readers that a complete list of msr of simple connected 
graphs with less than seven vertices has been given in [2]. 
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msr(G)=6 


G270-G274, G276,G278-G280,G284, G286 


msr(G)=5 


G314-G322,G324-G329, G331-G334, G336-G344, 
G348-G351, G353,G379-G386, G390-G394,G396, G398-G403, 

G405-G406, G408, G410-G416, G421-G423, G427-G428, 

G431-G435, G437-G442, G445-G446, G448-G449, G476-G484, 

G488-G489, G494, G497, G504, G506-G509, G512-G513, 

G515-G520, G522-G530, G532-G533, G540-G541, G548, 

G555-G556, G558-G561, G563-G566, G571, G574-G575, 

G579,G612-G618, G621-G623, G627-G628, G632, G638, 

G640, G646, G651, G669-G670, G673-G674, G676-G680, 

G682-G685, G690, G694, G698, G702,G706, G710,G790-G792, 

G794-G795, G797, G804, G806, G813, G840, G897 


msr(G)=4 


G388-G389, G395, G404, G409, G419-G420, G424-G426, 

G429-G430, G436, G443-G444, G447,G450,G473-G475,G485-G487, 

G490-G493, G495, G498-G501,G503, G505, G510-G511, G514, 

G521, G531, G534-G539,G542-G547, G549-G550, G552-G554, 

G562, G567-G570,G572-G573, G576-G578, G580-G581,G598-G610, 

G620,G624-G626, G629, G631, G633-G637, G639,G641-G643, 

G645, G647-G649, G652-G668, G671-G672, G675, G681,G688, 

G691-G693, G695-G697, G699-G701, G703-G705,G707-G709, 

G711-G724, G727-G730, G740-G744, G746-G747,G748-G780, 

G782-G786, G788-G789, G793, G798-G803, G805,G807-G808, 

G810-G812, G816-G828, G830-G839,G841-G850, G852-G855, 

G857-G862, G864-G872,G874,G876-G877, G886-G892,G898-G900, 

G902-G905,G907-G910, G913-G924, G926-G944,G946, G948, 

G950-G952,G954-G955, G957,G959-G969, G971-G982, G985-G987, 

G989,G992-G994,G997-G999,G1003-G1004,G1006-G1008,G1014-G1017, 

G1021,G1025-G1031, G1033-G1034, G1037, G1039, G1041-G1055, 

G1057-G1061, G1065, G1069,G1078-G1080, G1082-G1085, G1088-G1089, 

G1091-G1093, G1097,G1101,G1116-G1121,G1127-G1128, G1139-G1143, 

G1145,G1148,G1150,G1154,G1184-G1185, G1187, G1190. 


msr(G)=3 


G551, G557, G619, G630, G644, G650, G686-G687,G689 

G725-G726, G781, G787, G796, G809, G814-G815,G829,G851,G856 

G863, G873, G875,G878,G885, G893-G896,G901,G906,G911-G912, 

G925, G945, G947,G949, G953, G956,G958, G970, G983-G984,G988, 

G990-G991, G995-G996,G1000-G1002, G1005,G1012-G1013,G1018-G1020, 

G1022-G1024,G1035-G1036, G1038, G1040,G1056,G1060,G1062-G1064, 

G1066-G1068, G1070-G1077, G1081,G1086-G1087,G1090, G1094-G1096, 

G1098-G1100,G1102-G1106, G1108-G1115,G1122-G1126,G1129-G1138, 

G1144,G1146-G1147,G1149, G1151-G1153,G1155-G1159,G1161-G1163, 

G1165-G1171,G1173-G1183,G1186,G1189, G1191-G1210,G1212,G1214, 

G1215, G1217-G1225,G1227-G1229, G1231-G1233,G1235-G1236, 

G1239-G1241,G1245-G1246. 


msr(G)=2 


G1009, G1032,G1164, G1188,G1211, G1213, G1216, G1226,G1230, 
G1234,G1237-G1238, G1242-G1244, G1247-G1251. 


msr(G) = l 


G1252 
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4. Theorem 

By observing the results in Section 3, we get the following theorem. 

Theorem 4.1. Let G be a simple connected graph without pendant vertices. If\V(G)\ < 
7 and the tree size of G is k, then msr(G) is either k or k — 1. 

Proof. Define 

type = (ts, msr) 
We have the following, 

type = (6, 5) 

G446, G448-G449, G555-G556, G558-G559, G669-G670, G673-G674, G676-G678, 
G682 - G685, G790, G792, G794 - G795, G813, 

type = (5, 5) 

G445, G571, G574-G575, G579, G679 - G680, G690, G694, G698, G702, G706, G710, 
G791, G797, G804, G806, G840, 

type = (5, 4) 

G105, G127-G129, G145-G146, G148, G161-G162, G164, G443-G444, G447, G450, 
G553-G554, G570, G572, G576-G578, G580-G581, G668, G672, G675, G681, G688, 
G691 - G693, G695 - G697, G699 - G701, G703, G705, G707 - G709, G711 - G724, 
G727-G730, G793, G798-G801, G803, G805, G807-G808, G810-G812, G816-G828, 
G830 - G839, G841 - G850, G852 - G855, G857 - G862, G864 - G870, G872, G874, 
G876, G913 - G914, G918 - G924, G926 - G930, G932 - G934, G936, G941 - G944, 
G950 - G952, G957, G959 - G963, G965 - G966, G969, G973 - G975, G977 - G982, 
G985, G987, G989, G992-G994, G997, G999 G1007-G1008, G1025-G1026, G1030- 
G1031, G1042-G1045, G1047-G1048, G1050-G1055, G1057, G1061, G1080, G1085, 
G1088 - G1089, G1093, G1116, G1118, G1121, G1139 - G1140, G1142, G1148, G1160, 
G1184 - G1185, G1187, 

type = (4, 4) 

GU7, G149, G152, G167, G552, G573, G671, G704, G802, G871, G877, G915 - G917, 
G931, G935, G937-G940, G946, G948, G954-G955, G964, G967-G968, G971-G972, 
G976, G986, G998, G1003 - G1004, G1006, G1027 - G1029, G1033 - G1034, G1037, 
G1039, G1041, G1046, G1049, G1058-G1059, G1065, G1069, G1078-G1079, G1082- 
G1084, G1091-G1092, G1097, G1101, G1117, G1119-G1120, G1127-G1128, G1141, 
G1143, G1145, G1150, G1154, G1190, 

type = (4, 3) 

G38, G43-G44, G46-G47, G126, G130, G144, G150-G151, G153-G154, G163, G166, 
G168 - G175, G179 - G189, G192 - G193, G196 - G198, G201 - G202, G551, G557, 
G686-G687, G689, G725-G726, G809, G814-G815, G829, G851, G856, G863, G873, 
G875, G878, G925, G945, G947, G949, G953, G956, G958, G970, G983 - G984, G988, 
G990 - G991, G995 - G996, G1000 - G1001, G1005, G1035 - G1036, G1038, G1040, 
G1056, G1060, G1062-G1064, G1066-G1068, G1070-G1077, G1081, G1086-G1087, 
G1090, G1094-G1096, G1098-G1100, G1102-G1106, G1113-G1115, G1122-G1126, 
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G1129-G1138, G1146-G1147, G1149, G1151-G1153, G1155-G1159, G1161-G1163, 
G1165-G1171, G1175-G1183, G1186, G1189, G1191-G1210, G1212, G1214-G1215, 

G 1217 - G1221, G1227 - G1229, G1232 - G1233, G1240 - G1241, G1245, 

type = (3, 3) 

G796, G1144, G1222, G1224 - G1225, G1231, G1235 - G1236, G1239, G1246, 

type =(3,2) 

G16 - G17, G42, G48 - G51, G165, G190, G194 - G195, G199 - G200, G203 - G207, 
G1009, G1032, G1164, G1188, G1211, G1216, G1226, G1230, G1234, G1237 - G1238, 
G1242 - G1244, G1247 - G1251, 

type = (2, 1) 

G7, G18, G52, G208, G1252, □ 

5. Question 

Example 5.1. Let G be a simple connected graph with 12 vertices V(G) = {1, 2, • • • , 12} 
and the edge set is defined by _V(1) = {2,3}, N(2) = {1,3,4}, JV(3 = {1,2,4,7}, 
N{4) = {2,3,5,7,8}, N{5) = {4,6,8,9}, 7V(6) = {5,9}, N(7) = {3,4,8,10,11}, 
N(8) = {4,5,7,9,10}, N(?) = {5,6,8}, iV(10) = {7,8,11,12}, JV(ll) = {7,10,12}, 
iV(12) = {10, 11}. Then G is a chord graph. Then the tree size of G is 8 and msr(G) = 9. 
Hence Theorem 4.1 is false in this case. 

Question 5.1. Let Q n denote the set of all the simple connected graphs with n vertices 
and with no pendent vertices. Suppose n > 7. Under what conditions is the Theorem \.l 
true? 

Example 5.2. Let K m ^ n be the complete bipartite graph on m and n vertices. The 
tree size of K myTl is max{m,n}. It has been proved that msr(K mtn ) = max{m,n}. 
Theorem 4.1 is true in this case. 

Example 5.3. Let C be a circle with n vertices. The the tree size of G is n — 1 and 

msr(C) = n — 2. Theorem 4.1 is true in this case. 

Remark 5.1. [2, Theorem 2.9] gives a sufficient condition for a connected graph G such 
that msr(G) = ts(G) — 1. However not all of members in Qj satisfy the conditions in [2, 
Theorem 2.9]. 
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ON THE CONSTRUCTION OF NUMBER SEQUENCE IDENTITIES 

WUN-SENG CHOU AND TIAN-XIAO HE 

Abstract, to construct a class of identities for number sequences generated by linear recur- 
rence relations. An alternative method based on the generating functions of the sequences is 
given. The equivalence between two methods for linear recurring sequences are also shown. 
However, the second method is not limited to the linear recurring sequences, which can be 
used for a wide class of sequences possessing rational generating functions. As examples, 
Many new and known identities of Stirling numbers of the second kind, Pell numbers, Jacob- 
sthal numbers, etc., are constructed by using our approach. Finally, we discuss the hyperbolic 
expression of the identities of linear recurring sequences. 



1. Introduction 

Many number and polynomial sequences can be denned, characterized, evaluated, and clas- 
sified by linear recurrence relations with certain orders. A number sequence {a n } n >o is called 
sequence of order r if it satisfies a linear recurrence relation of order r 



&J 



a n -j, n>r, (1.1) 



for some constants pj (j = 1,2, ... ,r), p r ^ 0, and initial conditions a,j (j = 0, 1, . . . , r — 
1). Linear recurrence relations with constant coefficients are important in subjects including 
pseudo-random number generation, circuit design, and cryptography, and they have been 
studied extensively. To construct an explicit formula of the general term of a number sequence 
of order r, one may use generating function, characteristic equation, or a matrix method (See 
Comtet [6], Hsu [12], Strang [16], Wilf [17], etc.). In [10], He and Shiue presented a method 
for the sequences of order 2 using the reduction of order, which can be considered as a class of 
how to make difficult an easy thing. In next section, the method shown in [10] will be modified 
to give a unified approach to construct a class of identities of linear recurring sequences with 
any orders. An alternative method will be given in Section 3 by using the generating functions 
of the recursive sequences discussed in Section 2. The equivalence between these two methods 
for linear recurring sequences will be shown. However, the second method can be applied for 
all the sequences with rational generating functions. Inspired by Askey's and Ismail's works 
shown in [1], [4], and [13], respectively, we discuss the hyperbolic expression of the identities 
constructed by using our approach, which and another extension will be presented in Section 
4. 



This work is partially supported by the National Science Council in Taiwan under the grant number 100- 
2115-M-001-013. The author Wun-Seng Chou would like to thank the Department of Mathematics, Illinois 
Wesleyan University, for its hospitality. 
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2. Identities of linear recurring sequences 

We now modify the method applied in [10] and extend it to the higher order setting. More 
precisely, we will give a unify approach to derive identities of linear recurring sequences of 
arbitrary order r. The key idea can be described in the following theorem. 

Theorem 2.1. Let sequence {a n } n >o be defined by the linear recurrence relation (1.1) of 
order r, and let its characteristic polynomial P r (t) = t r — Yl^iPj^ -3 have r roots otj (j = 

1,2,..., r), where the root set may be multiset. Denote af[ '■= On a j-i a n-i ft — 3 — r ) 

and an := a n . Then 

a M =a n-+i a M i; ( 2.i) 

where 

r-l 

a « r) = a n - a n -i^2 on + a n - 2 ^ a^a? 

i=l l<i<jr<r— 1 

-a n _ 3 ^2 aiOjCXk H h (-l) r-1 o n _ r .+iII^r 1 1 ai (2.2) 

l<i<j<k<r-l 



for n > r — 1. 

Remark 2.1 an shown in (2.2) can be written as 

r-l 

°n r) = 5^( -1 ) n On-* Yl a kl ---a ki . 

i=0 l<fci<-<fci<r-l 

(2) 

Proof. Denote a„ := a n — a,\a n -i- Then the recurrence relation (1.1) can be reduced to 

r 
(21 (2) V^ (2) V~^ (2) V^ 

a n ' = a n _ l 2_^a k - a n _ 2 ^ a i a i + a n-z 2^ a i a j a k 

fc=2 2<i<j<r 2<i<j<k<r 

+(-l) r a { nl +i n r k=2 a k , (2.3) 

a linear recurrence relation of order r — 1 for sequence {a n } n >o- Similarly, we denote a n := 
a n — a2«n ■ Hence, from (2.3), we obtain 

r 
(3) (3) V^ (3) V~^ i ( 3 ) V^ 

a n ' = a n _ l l^a k - a n _ 2 ^ a i a j + a n-3 2^ a i a j a k 

fc=3 3<i<j<r 3<i<j<k<r 

+(-i) r - 1 4 3 2 r+2 nL 3 « fe . 

The above expression is a linear recurrence relation of order r — 2 for sequence {a n } n >o- 
Repeating this process and denoting a„ := a„ — a r _ia„_ 1 , we finally obtain 

«l r) = «rai r 2 1; (2.4) 
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which implies (2.1). In (2.1), for n > i — 1, 

(r) _ (r-l) _ O'-l) 

a n ~ a n a r-l a n -l 

(r-2) < , \ (r-2) . (r-2) 

= a K n ' - [a.r-i + a r -2)a y n _ 1 + av_iav-2<_ 2 

r-l 
( r _ 3 ) (r-3) V^ . (r-3) V^ (r-3) 

= a n a n-l 2^ a * + a n-2 2^ "^ _ °ri-3 a r-^Oi r - 2 a r -i, 

i=r—3 r—3<i<j<r—l 

which yields (2.2) by using mathematical induction. □ 

As an example, for r = 2, if the characteristic polynomial P-2,(t) = t 2 — pit — p 2 of (1.1) has 

(2) 

roots a\ and a 2 , then we denote ah. '■= o, n — aia n -i and obtain 

af£' ■= a n - aia n -i = a 2 (a n _i - aia n _ 2 ) = a2a n '_ 1 = cJpM ■ 

Similarly, for r = 3, we denote the roots of the characteristic polynomial Ps(i) = t 3 — pit 2 — 
Pit - p 3 of (1.1) by otj (j = 1, 2, 3). Then, 



4 2) : = «n - «l a n-l = ("2 + a3)On-l - «l(02 + 03)^71-2 ~ 0203(a„-2 ~ OlOn-3) 

- ( 

which implies 



( , \ (2) (2) 

(a 2 + 03)«n_l - a2«3<_ 2 ) 



(3) (2) (2) / (2) (2) x (3) n ~2 (3) 

Remark 2.2 If ay = 1, then (2.1) becomes 

<4 r) =<£>!, (2.5) 

where aj, r is shown in (2.2). In particular, for r = 2, we have 



a® - a (2) 



or equivalently, 



a n = aia n _i + ai — a±ao. 

Thus, we have shown that the last non- homogenous recurrence relation of order 1 is equivalent 
to the homogeneous recurrence relation of order 2, a n = (ai + l)a n -i — aia„_ 2 , for the same 
sequence {a ra } n >o- Similarly, we have the equivalence of the homogenous recurrence relation of 
order 3, a n = (p + l)a n _i — (p — q)a n -2 + qa n -3, and the non-homogenous recurrence relation 
of order 2, a n = pa n -\ + qa n _ 2 + d for uniquely determined constant d = a 2 — pai — gao- 

We now consider three special cases r = 2, 3, and 4 for some particular cases of Theorem 
2.1. 

Corollary 2.2. Lei {a n } n >o be a sequence satisfying the linear recurrence relation of order 2: 

an=Pia n -i+P2a n -2, n>2, 

with initial conditions ao and a\, and let the characteristic polynomial -P 2 (i) = t 2 — pit — p 2 
have roots a and (3. Then the sequence {a n } n >o satisfies the identity 

a n = P a l 1 
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where a>n = a n — aa n -\ for n > 1 

As an example, we consider Pell number sequence {P n }n>o generated by the recurrence 
relation 

P n = 2P n -\ + -P n -2 

with initial conditions Pq = and Pi = 1 . The roots of the characteristic polynomial t 2 — It — 1 
are a = 1 + y2 and j3 = 1 — v2. Hence, Corollary 2.2 gives identity for n > 1: 

P n - (1 + >/2)P„-i = (1 - v^)™" 1 , 
or equivalently, 

(1 - ^)P„ + P n _i = (1 - v 7 ^)". 
Similarly, we have 

(1 + y/2)P n + P n _i = (1 + V2) n 

for n > 1. 

Jacobsthal number sequence { J n }n>o is generated by 

'Jn — •Jn~\ ~r ^J-n—2 

with initial conditions Jo = and J± = 1. The characteristic polynomial t 2 — t — 2 has two 
roots a = 2 and (3 = —1. Hence, from Corollary 2.2, we obtain 

J„ — 2J„_i = (— l) n 

and 

T I t on— 1 

<^n ~r ^ro— 1 — -^ 

For Fibonacci number sequence {-F n }n>o and Lucas number sequence {L n } n >o, we may use 
the same argument shown above to construct the well-known identities as follows (see also [14] 
and [10]) . 

2 ^ + ^™-i- I 2 J ' 

2 ^ + ^™-i- I 2 I ' 

— - — L n - L n - X = V$ I — g — ) ' 

Corollary 2.3. Let {o n } n >o be a sequence satisfying the linear recurrence relation of order 3: 

a n = pia n -i + P2a n -2 + P3a n ~3, n>3, 

with initial conditions ao a\, and ai, and let the characteristic polynomial Ps(t) = t 3 — pit 2 — 
P2t — P3 have roots a, (3, and 7. Then the sequence {a„} n >o satisfies the identity 

a (3) _ n-2 (3) 
a n — 1 u 2 1 

(3) 

where a n = a n — (a + f3)a n -i + a(3a n -2 for n > 2. 
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Corollary 2.4. Let {a ra } n >o be a sequence satisfying the linear recurrence relation of order 4: 

a n = pia n _i + p 2 a n -2 + P3«n-3 + P4d n -4, n > 4, 

with initial conditions ao a±, a 2 , and a^, and let the characteristic polynomial P±(t) = t 4 — 
Pit 3 — p 2 t 2 — Pit— p^ have roots a, (3, j, and 5. Then the sequence {a n }n>o satisfies the identity 

a n = o a 2 , 

where a„ = a n - (a + (3 + 7)a n _i + (a/3 + 07 + (3^)a n - 2 + a/?70n-3 for n > 3. 

Examples related to some famous linear recurring sequences in combinatorics are presented 
below for the applications of Corollaries 2.2, 2.3, and 2.4. 

Example 1. We now construct identities for sequences shown in Table 6 of [9] (see also 
in http://www.research.att.com/ njas/sequences/). Sequence A001047, a n = 3™ — 2 n = 

2 < > + < > satisfies recurrence relation a n = 5a n _i — 6a„_2> where < , 

denote Stirling numbers of the second kind. Thus, from Corollary 2.2, we have 

a n = 2a n _i + 3 n_1 , and a n = 3a„_i + 2 n_1 , 

which implies the following identities of Stirling numbers of the second kind: 

2 { n n+{ B rHhH{2H"" 

The above identities imply 2 < > + < > = 3 ra — 2 n , but the converse implication 

is not obvious. 

Similarly, for Sequence A003462, a n = (3™ — l)/2 satisfying a n = 4a n _i — 3a n _2, there hold 

n + llfn+ll j n \ f n 1 _ n _i 

3 ] + l 2 ] [ 3 j I2}- 3 • 

Mersenne number sequence a n = 2 n — 1 (AO 00225) satisfying a n = 3a ra _i — 2a n _2 generates 

a n = a n _i + 2 n ~ 1 , and a n = 2a„_i + 1, 



or equivalently, 



n + 1 { J n 

2 ;-\ 2 

n + l I f n 



2 n-i 



2 2 



due to a n = ^ 9 
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We now consider examples of sequences of order 3. If the sequence {a n } ra >o satisfies a linear 
recurrence relation of order 3: 

a n = 3ka n ^! - (3k 2 - l)a n _ 2 + k(k 2 - l)a n _ 3 

for some positive integer k, then solutions of the equation £ 3 — 3kt 2 + (3k 2 — l)t — k(k 2 — 1) = 
are k ± 1 and k. Thus, Corollary 2.3 shows that the sequence satisfies identities 

a n - 2ka n - l + (k 2 - l)a n _ 2 = k n ' 2 (a 2 - 2k ai + (k 2 - l)a ), (k > 0), 

(2.6) 
a n - (2k + l)a n _i + k(k + l)a n _ 2 

= (k- l) n ~ 2 (a 2 - (2k + l)ai + fc(fc + l)a ), (fc > 1), (2.7) 

a n - (2k - l)a„-i + k(k - l)a n -2 

= (k + l) n ~ 2 (a 2 - (2k - l)oi + fc(A - l)ao), (fc > 1). (2.8) 

In particular, if ao = ai = 1 and a 2 = 2, then (2.6)-(2.8) can be written as 

a n - 2ka n -! + (k 2 - l)a n _ 2 = (k - \) 2 k n ~ 2 \ (k > 0), (2.9) 

a n - (2k + l)a„_i + fc(fc + l)a n _ 2 = (k 2 - k + l)(fc - l) n " 2 , (fc > 1), 

(2.10) 

(2.11) 



a n - (2k - l)a„_i + fe(fc - l)a n _ 2 = (k z - 3k + 3)(k + l) n ~ z , (k > 1), 



respectively. 

Example 2. Sequence A129652, {a n } n >o = {1,1,2,7,26,91,...}, is defined by the linear 
recurrence relation of order 3: 

a n = 6a n _i - lla n _ 2 + 6a n _ 3 

with initial conditions ao = «i = 1 and a 2 = 2. It is easy to see the three roots of the 
characteristic polynomial equation t 3 — 6t 2 + lit — 6 = are 1, 2, and 3. Thus, using (2.9)- 
(2.11) for k = 2, we obtain identities 

a n - 4a„_i + 3a„_ 2 = 2 n ~ 2 , 
a n - 5a„_i + 6a„_ 2 = 3, 
a n — 3a n -\ + 2a n -2 = 3 n , 

respectively. Let P(A) be the power set of an n-element set A. Then a n -\ is the number of 
pairs of elements x,y of P(A) for which either (1) x and y are disjoint and for which x is not 
a subset of y and y is not a subset of x, or (2) x and y are intersecting and for which either 
x is a proper subset of y or y is a proper subset of x, or (3) x = y. Hence, the general term 

f n + 1 ] 

of {a n }„>o = {1,1,2,7,26,91,...} is a n = < > + 1 (See Have [9]). From the above 
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identities, we obtain identities of Stirling numbers of the second kind as follows: 

n-2 



3 J I 3 J L 3 

B j 1 }- s {;}+4 B ; 1 ^ s 



Sequence A162723, {a n } n > = {1, 1, 2, 16, 116, 676, . . .} is denned by a n = 9a n _i -26a n _ 2 + 
24a n _3 with initial conditions ao = ai = 1 and a 2 = 2. Its characteristic polynomial p(t) = 
i 3 + 9t 2 - 26t + 24 has roots 2, 3, and 4. Thus, we apply (2.9)-(2.11) for k = 3 to the sequence 
and obtain 

a n - 6a„_i + 8a n -2 = 4 • 3 n ~ 2 , 
a n — 7a n -\ + 12a n _2 = 7-2 , 
a n - 5a„_i + 6a n _ 2 = 3 • 4 n ~ 2 , 

respectively. 

If a sequence {a n } n >o satisfies the linear recurrence relation of order 3: 



2(k + l)a„_i - (/c 2 + 3k + l)a n _ 2 + fc(A: + l)a r . 



for some positive integer k > 1, then roots of the characteristic polynomial ^(i) = t 3 — 2(fc + 
l)t 2 + (k 2 + 3fc + l)t — fe(fc + 1) are 1, k and A; + 1. Thus, Corollary 2.3 shows that the sequence 
satisfies identities 

a n - (2k + l)a„_i + k(k + l)a„_ 2 = a 2 - (2fc + l)oi + fc(fc + l)a , (2.12) 

a n - (k + 2)a n _i + (k + l)a n _ 2 = A; n_2 (a 2 - (k + 2)oi + (fc + l)a ), (2.13) 

a n - (k + l)a n _i + ka n ^ 2 = (k + l) n_2 (a 2 - (fc + l)ai + fca ). (2.14) 

In particular, if ao = «i = 1 and a 2 = 2, then (2.12)-(2.14) can be written as 

a n - (2k + l)a n _i + fc(fc + l)a n _ 2 = fe 2 - k + 1, (2.15) 

a n - (k + 2)a n -i + (k + l)a n _ 2 = fc™" 2 , (2.16) 

a n -(k + l)a n _i + fca n _ 2 = (fc + l)"" 2 , (2.17) 

respectively. 

Example 3. Consider Sequence A000325, {a n } n > = {1,1,2,5,12,27,58,...}, which is de- 
fined by a n = 2 n — n and satisfies the recurrence relation 

a n = 4a n _i - 5a n _ 2 + 2a„_ 3 , n > 3, 

with initial conditions ao = a\ = 1 and a 2 = 2. DeSario and Wenstrom [8] have shown that 
a n is the number of different permutations of a deck of n cards that can be produced by a 
single shuffle. From Lascoux and Schutzenberger [15], one may see that a n is also the number 
of permutations of degree n with at most one fall, called Grassmannian permutations. Since 
the corresponding characteristic polynomial equation t 3 — 4t 3 + 5£ 2 — 2 = has solutions 1, 
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1, and 2, we may use (2.16)-(2.17) (Note (2.15) and (2.16) are identical for k = 1) and obtain 
identities 

a n — 3a n _i + 2a„_2 = 1, 

a n — 2a n _i + a n _2 = 2 , 

respectively. 

For k = 2, we obtain Sequence A129652, {a n } n > = {1, 1, 2, 7, 26, 91, . . .} from (2.15)-(2.17). 
This sequence and its three identities have been presented in Example 2. Similarly, if fc = 3, 
we get Sequence A162725, {a ra } n >o = {1, 1, 2, 9, 46, 221, . . .}, which is defined by 

a n = 8a n _i - 19a n __ 2 + 12o n _ 2 . 
Hence, there hold 

a n — 7a n -\ + 12a n _2 = 7, 
a n - 5a n -\ + 4a„_2 = 3 n ~ 2 , 
a n - 4a n _i + 3a„_ 2 = 4 n_2 . 

3. An alternative method using the generating functions 

Let {a n } n >o be the linear recurring sequence defined by (1.1). Then its generating function 
P{t) can be written as 

r— 1 / n \ r 

Pit) = {a + Y, «" - E^' a ^' * n }/< 1 - E^' tJ }- ( 3 - X ) 

n=l \ j=l / j = l 

Hence, we have the following result. 

Proposition 3.1. Let the characteristic polynomial of the linear recurring sequence {a ra } n >o 
defined by (1.1) be p{t) = n[ =1 (£ — ojj). Then the denominator of the generating function P(t) 
of {a n } n >o equals n[ =1 (l - ait). 

The proof is straightforward and omitted. Based on this fact, we may give the following 
method, which is an alternative method of that presented in Section 2. The equivalence of 
two methods will be shown later. 

Proposition 3.2. Let {a n } n >o be a sequence with the generating function P(t) = A(t)/B(t), 
in which A(t) can be expressed as a formal power series and B(t) is a non-null polynomial. 
Suppose that B(t) can be factored as B(t) = qi(t)q2(t) with 



then 

implies 

or equivalently, 



qi{t) = go + git + §2t 2 -\ h g r t r , 
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A(i) 

goa n + gia n -i H V g r a n - r = [t n ]—-x- (3.2) 

Q2{t) 

Observe that no formal proof is necessary, since everything depends on the "rules of the 
generating function" and the "coefficient of operators" , which are immediate. If we are able 
to extract the coefficient of t n from A(t)/q2(t) then we have obtained a (non homogeneous) 
recurrence relation of order r by using Proposition 3.2. 

First, let {a ra } n >o be the linear recurring sequence defined by (1.1) and let Pit) be the 
generating function. From equation (3.1) and Proposition 3.1, we have 

oo + E[=i ( a i ~ £}=i Pj a i-j) t* 

m = n?=i(i -«it) ' 

and so 

a + EI=i ( a i ~ Y!j=\ Pj a i-j) t% 

p<mz{{\ - cat) = \_ ; 3) . (3.3) 



Multiplying out the left hand side of (3.3), we have 



r-l 



nP^ir:-^! - out) = a n - a n _! Y,^ + ■■■ + (-l) r - 1 o n _ r+ iDj- 1 1 a i = a£> (3.4) 



i=l 



for all n > r — 1. Multiplying out the right hand side of (3.3), we have 



r— 1 / i 

aoa™ + ^ a, - ^Pj( 

r— 1 r— 1 i 

i=0 

r-l 




= <- r+1 4-i (3-5) 

with the convention that (— l) r-1_l £ 1<fc <...<& <r Ofci " " " Ofcr-i-i = 1 for i = r — 1. From 

(3.3), (3.4) and (??), we have an = a™~ r+1 a),_ 1 for all n> r, which is the same as the result 
in Theorem 2.1. So, the method in Proposition 3.2 is indeed an alternative method of that 
presented in Section 2 for linear recurring sequences. 

Let P(t) = A(t)/B(t) be the generating function of sequence {a n }„>o, in which B(t) is 
a non-null polynomial of degree r. Then, B{t) can be factoring into a product of r linear 
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factors defined in C, which implies that 2 r identities can be constructed by using the method 
shown in Proposition 3.2. In other words, there are 2 r identities in C can be found from the 
linear recurring sequences defined by the homogeneous recurrence relation (1.1) of order r, 
because the generating function of {a n }„>o is A(t)/B(t) with B(t) = 1 — Y7j=iPj^ ■ The more 
important is that this method can be applied to any sequence possessing a rational generating 
function, or even the numerator is a formal power series. The latest case is the most interesting 
case in this section. We demonstrate it with some examples as follows. 

Example 4. For each nonnegative integer n, the Fine number f n is considered to be the 
number of rooted trees of order n with root of even degree. In [7], the generating function of 
Fine number sequence {f n }n>o is presented as 

v J 2t(t + 2) v ; 

Using Fad di Bruno 's formula, Chou, Hsu and Shiue [5] give the expressions 

L V ^ 

2t(t + 2) 



F(t) = — ^ 3 " 1 "- 1 ; (3.7) 



and 

_lA (-\) n - k (2k\ 

with /o = 1 and f\ = 0. Using Preposition 3.2 and equations (3.6) and (3.7), we may obtain 
an identity of Fine number f n for n > 1 as follows 

/„_! + 2/ n = -±- ( 2n ) = C n , (3.8) 

n + 1 \ n J 

where C n is the nth Catalan number. 

For any non-negative integer n, the Riordan number r n can be viewed as the number of tall 
bushes with n + 1 edges (see Bernhart [3]). Let R(t) = Y^=o r nt n be the generating function 
of Riordan numbers. As shown in [5], 

_ 1 + t -sr=2t=W 



with 



2t(l + t) 



/ ^ ^ t n ^ (2n-2k-2)\3 k 

Vl-2t-3t 2 = 1- > -> — -i- ttt ' ,.,,, . (3.10) 

Z^ 2 n ~ 1 ^ (n-k- l)\(n-2k )\k\ V ; 

n=l k=0 y ' v ; 



[3] also gives that 

1_ sr^( l - ] m(n+l\f2n-2m\ 
+ 1 ^ \ m J \ n — m J ' 



n + _ 

m=0 
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Using Preposition 3.2 and equations (3.9) and (3.10), we obtain an identity of Riordan number 
r n for n > 1 as follows 

rn±li 



1 VV (2n-2k)\3 k 

r n + r n -i-— 2^ r „_u|/ n _uxniy ^ 3 - n ^ 



2 n Z_^ ( n -jfe)!( n -2fc + l)!ife!" 

For any non-negatitive integer n, the central Delannoy number d n is defined by the number 
of lattice paths on the plane from (0,0) to (n,n) with steps (1,0), (0,1), and (1,1). Let 
D(t) = Yln=odnt n be the generating function of central Delannoy numbers. Then we have 
(see Bandcricr and Schwcr [2]) 

1 \/l - 6t + t 2 

D(t) = ; = ^ (3.12) 

k ; Vl - 6t + t 2 1 - 6t + 1 2 k ; 

and 



"-Eh 



i=0 



2 



Using the same method in [5], we have 



3 n £ n ^ (-1) 



y/i - Q t + 1 2 = i + y £_L. y L^L (3.i3) 

v Z^2™- 1 ^3 2 * (n-i-1 )! (n - 2i) z v ; 

n=\ i=0 v ; v ; 

From Preposition 3.2 and equations (3.12) and (3.13), we obtain an identity of the central 
Delannoy number d n for n > 2 as 

f-1 

on l2 J / i \i 

d n - 6^ + d„„ 2 = 2^1 E 3 «(n-i-l)!(n-2i)!i!- (3 " 14) 

Note that the roots of 1 — 6t + t 2 are 3 ± \/2. So, one can obtain other two identities for the 
central Delannoy numbers using Preposition 3.2 and equations (3.12) and (3.13) similarly. 

4. Extensions 

In the last section, we will apply the following two techniques to derive more identities or 
to give hyperbolic expressions of identities from the results obtained in Sections 2 and 3. 

Proposition 4.1. Let sequence {a n } ra >o be defined by the linear recurrence relation (1.1) 
of order r and let the characteristic polynomial Pit) = t r — s ^ r j = iPjt r ~ :) have r roots aj 
(j = 1,2, ... ,r), where the root set may be multiset. Denote 

a 0') ;=a 0-i)_ aj _ ia 0-i) (2<j< r ) 
and a n := a n . Then there hold identities 

<$ ± a n+k - ^2pja n+k -j = a^ r+1 a { j\ (4.1) 



for any integer k satisfying n + k > r, where 
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r-1 

a n ) = «n - a n-l ^ «i + On-2 ^ ttjOj ~ On-3 ^ aiO>ja k 

i=\ l<i<j<r—l l<i<j<k<r-l 

+ ... + (_l)-i an _ r+1 n[- 1 1 a, (4.2) 

for n > r — 1. 

Example 5. In Section 2 (see the paragraphs after Corollary 2.2), we obtain two identities 
for Pell number sequence {P n }n>o, which has the characteristic polynomial t 2 — It — 1. From 
Proposition 4.1, for k = 1, we immediately have identities: 

Pn+1 - Pn - (2 + >/2)P„-l = (1 " V2) n -\ 
Pn+1 -Pn~(2- V^)P„-1 = (1 + V^)"" 1 . 

Similarly, for Jacobsthal number sequence { J n }n>o and Fibonacci number sequence {F n } n >o, 
there hold identities: 

Jn+i — 4J n -i = (—1)™ , 

T — T — o n ~l 

J-n+l J n—1 — ^ t 

Pn+1 j— Pn-1 - [—^~ ) ' 

F 3 ~^ F _ ^ 1 + ^ V' 1 

Pn+1 j— Pn-1 - ^^^ J " 

From [10], let a {a n } n >o be linear recurring sequence of order 2 satisfying the linear recur- 
rence relation, 

a n = pa n -i + qa n ^2- (4.3) 

and denote by a and f3 the two roots of the characteristic polynomial p(t) = t 2 — pt — q, then 

a n = { {**¥) ^ ~ (^ ) ^ ifa ^^ (4.4) 

[ na\a> n — (n — l)ao<x n , if a = (3. 

Inspired by [1, 4, 11, 13], denote 

a(9) = ^qe , (3(9) = -^je' 9 (4.5) 

for some real or complex number 6, where q > 0. For the case of q < 0, we denote 

(n,(R\ MOW -I (v /= ^e e ,v /= &e- e ) forp>0, 

for some real or complex number 9, and the remaining process is similar, which we leave for 
the interested readers. From (4.5) we may have 

p(9) = 2^sinh(#) (4.6) 

and define a parametric expression of {a n } n >o as 
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a n (9) = 2 v /gsinh(0)a n _ 1 (0) + qa n ^ 2 (9), a o (0) = a , a ± (9) = ^^ sinh#. (4.7) 

p 



Obviously, if 



'-"■"(vs,!- 



{an(0)}n>o is reduced to {a n }„> . 

Substituting expressions (4.5) into (4.4), we obtain 

a n (0) 

(n-i)l2 a ^ nd ~ (-l)"e-" e ) + y/qaojeW + (-l)" e -("-i)g) 
q e e + e- s 

J g c"osh0 ( Q i s inn "-^ + y^Qo cosh(n - 1)0) , if n is even; , . 

I 9 C osh6> ( Gl cosn n ^ + \/^ ao s i nn ( n ~~ 1)0) > */ n ^ s °^- 

Some properties and extensions of {a n (9)} n >o can be derived from (4.9). For instance, from 
the first equation of (4.9) and using r = — e~ 2e , we have 

\ 1 — v \ — r 

which enables us to extend the definition of a n (9) to nonpositive values of n. 

Since a(9) and (3(9) shown in (4.5) are two roots of the characteristic polynomial equation 
x 2 — p(9)x — q = 0, we may write (4.3) as 

a n (9) = (a(9) + (3(9))a n ^(9) - a(9)(3(9)a n - 2 (e), (4.10) 

where a(9) and /3(d) satisfy a(9) + (3(9) = p(9) and a(9)/3(9) = -q. Therefore, from (4.10), 
we have 

a n (9) - a(9)a n - 1 (9) = (3(9)(a n -i(e) - a(0)a n _ 2 (0)), (4.11) 

which implies 

Proposition 4.2. A sequence {a n (9)} n >o of order 2 satisfies the linear recurrence relation 
(4-3) if and only if it satisfies the non-homogeneous linear recurrence relation of order 1 with 
the form 

a n (9) = a(9)a n ^(9) + d(9)^ n - 1 (9), (4.12) 

where d(9) is uniquely determined. 

Proof. The necessity is clearly from (4.11). We now prove sufficiency. If the sequence {a n (9)} n >o 
satisfies the non-homogeneous recurrence relation of order 1 shown in (4.12), then by substi- 
tuting n = 1 into the above equation we obtain d(9) = a\(9) — a(9)ao- Thus, (4.12) can be 
written as 

a n (9) - a(9)a n - 1 (9) = (a ± (9) - a(9)a Q (9))p n -\9), (4.13) 

which yields 
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a n _i(0) - a(9)a n . 2 (9) = (ai(0) - a(0)ao(#))/3"- 2 (fl). (4.14) 

Multiplying the both sides of (4.14) by (3(9) and evaluating the difference of the resulting 
equation and (4.13), one immediately knows that {a n (9)} n >o satisfies the linear recurrence 
relation of order 2: a n {9) = p(9)a n - 1 (9)+ qa n - 2 (9) with p(9) = a{9) +(3(9) and q = -a(9)(3(9). 

□ 

Example 6. As an example, we may consider the parametric Fibonacci numbers defined by 

F n (9) = 2smh(9)F n ^ 1 (9) + F n ^ 2 (9), F = 0,F 1 = 2smh(9). 

Here a(9) = e e and (3(6) = —e~ e . From (4.12) there holds an identity for the parametric 
Fibonacci numbers 



or equivalently, 
Similarly, we have 



F n (9) = e e F n ^(9) + 2(-l)™- 1 sinh^e"^- 1 ) 6 , 
-e- F n (9) +F n _i(0) = 2(-l)"e-" e sinh(0). 



e y F n (9) + F n _i(0) = 2e ny sinh(9). 
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Abstract 

Sensitivity analysis for generalized setvalued variational inclusions based 
on (A, if) resolvent operator technique is investigated. The obtained re- 
sults encompass a broad range of results. 

Keywords: Sensitivity analysis, generalized setvalued variational inclu- 
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1 Introduction 

There exists a vast literature on the approximation solvability of nonlinear vari- 
ational inequalities as well as nonlinear variational inclusions using projection 
type methods, resolvent operator type methods or averaging techniques. In 
most of the resolvent operator method, the maximal monotonicity has played 
a key role, more recently the notions of A-monotonicity and ii-monotonicity 
have not only generalized the maximal monotonicity, but gave a new edge to 
resolvent operator methods. In [12] the author generalized the recent notion of 
A-monotonicity to the case of (A, ?7)-monotonicity while examining the sensi- 
tivity analysis for a class of non-linear variational inclusion problems based on 
the generalized resolvent technique. Resolvent operator techniques have been 
in use for a while in literature, especially with the general framework involving 
set valued maximal monotone mapping, but it got a new empowerment by the 
recent development of A-monotonicity and if -monotonicity, see [2, 3, 4, 8, 17]. 
Dafermos [1] studied the sensitivity property of solution of a particular kinds 
of variational inequality on a parameter which takes values on an open subset 
of Euclidean space R K . Tobin [14], Verma [15, 20], Salahuddin [11], Salahud- 
din and Khan [7], Kyparisis [7], Robinson [10], Moudafi [9], Kimura and Yao 
[5, 6] and Yen and Lee [23] studied the sensitivity analysis of various type of 
variational inequalities. In this paper, we present the sensitivity analysis for 
(^4, 77)-monotonc variational inclusion based on the generalized (A, ^-resolvent 
operator technique. The notion of (A, ?7)-monotonc mappings upgrades the no- 
tion of A-monotonicity [16, 18], which generalized the well-known class of max- 
imal monotone mappings to maximal relaxed monotone mappings [12]. The 
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obtained results generalize a wide range of results on the sensitivity analysis for 
sctvalucd variational inclusion. 



2 (A, ?7)-Monotonicity 



We intend in this section, to explore some basic properties derived from the 
notion of (A, 77)-monotonicity. Let X denote a real Hilbert space with the norm 
|| • || and inner product (•,•). Let )j:XxX->Jbea single- valued mapping. 
The map rj is called r-Lipschitz continuous if there exists a constant r > such 
that 

||^(w,v)|| < t||i4 — v\\, Vu,»£l. 

Definition 2.1 [21] Let ?):IxI->I be a single-valued mapping and M : 
X — > 2 X be a multivalued mapping on X . The map M is said to be 

(i) (r, rj) -strongly monotone if 

(u* — v* , n(u, v)) > r\\u — u|| , V(u,u*),(v,v*) E Graph(M); 

(ii) rj-pseudomonotone if 

(v*, n{u,v)) > 0, 

implies 

(u*,T](u,v)) > 0, V(w, u*), (v, v*) € Graph(M); 

(Hi) (m, 77) -relaxed monotone if there exists a positive constant m such that 
(u* — v*, r](u, v)) > (— m)\\u — w|| 2 , V(u,u*),(v,v*) € Graph(M). 

(iv) rj-firmly nonexpansive if 

\\ u — v\\ < (u* — v* , T)(u, v)) V(u, u*), (v, v*) e Graph(M). 

Definition 2.2 A mapping M : X — >• 2 X is said to be maximal (m, rj) -relaxed 
monotone if 

(i) M is (M, 77) monotone; 

(ii) for (u,U*) G X x X and 

(u* — v *, r](u, v)) > (— to)||w — v\\ , V (v,v*) G Graph(M); 

we have u* G M{u). 

Definition 2.3 Let A : X — >• X and n : X x X — >• X be two single-valued 
mappings, the map M : X — > 2 X is said to be (A, r/)-monotone if 

(i) M is (M, 77) relaxed monotone; 
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(ii) R(A + pM) = X for p>0. 

Note that alternatively, the map M : X — > 2 X is said to be (A, rj) -monotone if 

(i) M is (M , 77) -relaxed monotone; 

(ii) A + pM is rj-pseudomonotone for p > 0. 

Proposition 2.4 [22] Let A : X — >• X be an (r ,rj)- strongly monotone single- 
valued mapping and M : X — > 2 X be an (A, rj) -monotone mapping. Let rj : 
X x X — > X be T-Lipschitz continuous single-valued mapping. Then M is 
maximal (m, rj) -relaxed monotone and {A + pM)X — X for < p < —. 

Proposition 2.5 [22] Let A : X — > X be an (r, 77) -strongly monotone single- 
valued mapping and M : X — > 2 X be an (A, rj) -monotone mapping. In addition, 
let r\ : X x X — > X be T-Lipschitz continuous. Then (A + pM) is maximal 
rj-monotone for < p < — . 

Proposition 2.6 [22] Let A : X — > X be an (r,r])~ strongly monotone mapping 
and M : X — > 2 X be an (A, rj) -monotone mapping. If in addition, 77 : X x X — > 
X is T-Lipschitz continuous, then the operator (A + pM)^ 1 is single-valued for 

Definition 2.7 Let A : X — > X be an (r,r\)- strongly monotone mapping and 
M : X — > 2 X be an (A, rf) -monotone mapping. Then the generalized resolvent 
operator J^' A '■ X — > X is defined by 

Jl™{u) = (A + pM)-\u). 

Definition 2.8 The map T:lxIxL->I is said to be r-strongly monotone 
with respect to A in the first argument if there exists a positive constant r such 
that 

(T(x,u,\)-T(y,u,\), A(x)-A(y)) > r||a;-y|| 2 , V (x,y,u,X) G XxXxXxL. 

Proposition 2.9 [22] Let r\: XxX^rXbea T-Lipschitz continuous map- 
ping, A : X — > X be an (r, 77) -strongly monotone mapping, and M : X — > 2 X 
be an (A, rj) -monotone mapping. Then the resolvent operator HP' A : X — > X is 
{^- — )-Lipschitz continuous that is 

\\ J p'a( u ) - J p'a ( v )\\ ^ \\ u - v \\, for all u,veX. 

' ! H ' r — pm 

Definition 2.10 The map f : X x L — > X is said to be (p,q) -relaxed cocoercive 
if there exist constants p and q such that 

(/(«, A) - f(v, A), u -v)> -p\\f(u, A) - f(v, A)|| 2 + q\\u - v\\ 2 , 
for all (u, v, A) <G X x X x L. 
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3 Sensitivity Analysis 

Let T, V, Q : X x L — > 2 X be three setvalued mappings, N: XxXxL^-X 
a nonlinear mapping and M : X x X x L — > 2 X be an A-monotone mapping 
with respect to first mapping T, where L is a nonempty open subset of X. 
Furthermore let -q : X x X — >• X be a nonlinear mapping and / : X x L —$■ X 
be a map then the problem of finding an element u € X, x € T(u,X), y G 
V(u, A), w £ Q{u, A) such that 

f(u,X)€N(x,y,X) + M(u,w,X) (3.1) 

where A £ L is the setvalued parameter is called a generalized setvalued varia- 
tional inclusions. 

The solvability of problem (3.1) depends on the equivalence between (3.1) 
and the problem of finding the fixed point of the associated generalized resolvent 
operator. 



Note that if M is (A, ?7)-monotone, then the corresponding generalized 
resolvent operator J q p \ in the first argument is defined by 

J^ { -' w \u) = (A + pM(.,w))- 1 (u), VueX, (3.2) 

where p > and A is an (r, ?y)-strongly monotone mapping. 

Lemma 3.1 [22] Let X be a real Hilbert space and r\: XxX^rXbea 
r-Lipschitz continuous nonlinear mapping. Let A : X — > X be (r,r/) -strongly 
monotone and M:XxXxL^r 2 X be (A, r/) -monotone in the first variable 
with m. Then the generalized resolvent operator associated with M(., v, A) for a 
fixed y G X and defined by 

jri,M(., v ,X) {u) = {A + pM{ ^ A)) -l (u)) y ueX , 

is (—^ — ,77)- firmly nonexpansive. 

Lemma 3.2 Let X be a real Hilbert space, T,V,Q:X^>2 X be the set valued 
mappings, let A : X — > X be (r, r/) -strongly monotone and M : X x X x L — > 2 X 
be (A, rj) -monotone in the first variable. Let f : X x L — > X be a single valued 
mapping. Let r\:XxX^rXbea r-Lipschitz continuous nonlinear mapping 
then the following statements are mutually equivalent: 

(i) An element u € X, x € T(u, A), y€ V(u, A), w € Q(u, A) 
is a solution to (3.1), 

(ii) the map G : X x L — > X defined by 

G(u,X) = j;,f { -' w ' X) (A(u) -pN(x,y,X) + pf(u,X)) 

has a fixed point. 
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Proof. Let u e X, x e T(u, A), y e V(u, A), w G Q(u, A) be a solution of (3.1). 

(3.1) <*=*> /(«, A) G JV(x, y, A) + M («, to, A) 

G iV(x, y, A) - f(u, A) + M(u, w, A) 

G pN(x, y, A) - p/(u, A) + pM(u, w, A) 

G -4(u) + pAT(x, y, A) - p/(m, A) + A(u) + pM(u, w, A) 

G -(i4(«) - /t>JV(x, y, A) + p/(w, A)) + (4 + pM (•, w, X)(u) 

u = (A + pM(-,w, A))~ V(«) - pN[x, y, A) + pf(u, A)) 

u = j;^ w ' X \A{u) - pN(x, y, A) + pf(u, A)) 

From the Lemma 3.2, we see that (3.1) is equivalent to the fixed point problem 
of type 

u G G(u,X). 

Therefore 

G(u, A) = J^ { -' w ' x \A{u) - pN(x, y, A) + pf(u, A)) 

■ 

Theorem 3.3 Let X be a real Hilbert space and r\ : X xX — > X be a t -Lipschitz 
continuous nonlinear mapping. Let A : X — > X be (r,r/)- strongly monotone and 
s-Lipschitz continuous and M : X x X x L — > 2 X be (A, n) -monotone in the 
first variable with m. Let X ', V ]Q : X x L — > 2 X be the setvalued mappings and 
'H-Lipschitz continuous with 7, <r, £ constants respectively. Let f : X x L — > X 
be the (p,q)-relaxed cocoercive and Lipschitz continuous with constant C,. Let 
J\f:IxIxL-)I be a-strongly monotone with respect to A and T and 
f3-Lipschitz continuous in the first variable and let N be p-Lipschitz continuous 
in the second variables, and k-relaxed monotone with respect to V. If in addition 

II T TI,M(.,Wl,X) ( N jri,M(.,W 2 ,\) / \|| ^- r| || 

\\ J p,A ( z )- J p,A ' {Z)\\ < 5\\W! - W2\\ 

J 

V (u, v, A) G X x X x L, wi G Q(u, A), w 2 G Q(v, A). (3.3) 

TTien 

||G(w,A) -G(v,A)|| < 0||u-v||, V («,u,A) G X x X x X, (3.4) 



«,'/? 



( -/■(■: 



— - — [Vs 2 - 2 pa + p 2 (3 2 -/ 2 + pfl] +t<l,£l = B + D,t = 6£ 

r — pra 

£ = v / (l-2<z) + C 2 (l + 2p) 
D = ^/l + 2k + p 2 a 2 , 
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< 



ar 2 - r{\ - t)(m(l -t) + fir) 
/3 2 7 2 r 2 - (m(l - £) + fir) 2 

sJ(aT 2 - r{\ - t)(m(l - t) + fir)) 2 - (t 2 s 2 - r 2 (l - £) 2 )(^ 2 7 2 r 2 - (m(l - t) + fir) 2 ) 

/3 2 7 2 r 2 - (m(l - i) + fir) 2 

ar 2 > r(l-t)(m(l-t)+fir)+v/(r 2 s 2 - r 2 (l - £) 2 )(/3 2 7 2 r 2 - (m(l - i) + fir) 2 ) 

ar 2 > r(l - i)(m(l - t) + fir) 

rs > r(l - t) 

,3 7 r > m(l - t) + fir 

£< 1. 

Consequently for each AgL, £/ie mapping G(u, A) in £/ie %/rf ^5.^ ftas a 
unique fixed point z{\). Hence in light of Lemma 3.2, z(X) is a unique solution 
to (3.1). Thus we have 

G{z(X),X) = z(X). 

Proof. For any elements (u, v, X) e X x X x L and x\ <G T(m, A), ^2 € T(w, A), 
j/i e V(u, A), j/2 e V(u, A),ioi € <5(m, A),W2 e Q(u, A), we have 

||G(«,A)-G( V ,A)|| = ||J^ , f ( - ,0l ' A) (^(ti)-pJV(a : i ) yi,A)+p/(«,A)) 

-j;,f ( -' W2 ' A) (^)-^(^,y2,A) + /9 /( V) A))|| 

< ||J£f ^'^(li) - P N(x u yi,X) + pf(u,X)] j;<^ X) [A{v) 

-pN(x 2 ,y2,X) + pf{v,X)}\\ 

+ p ,,M(., Wl ,A )[A{v) pN{x2 ^ A) + p/(V) A)] _ jV : M { ., W2 ,X )[A{v) 

-pN(x2,y2,X) + pf(v,X)}\\ 

< -^—\\A(u)-A(v)-p(N(x 1 ,y 1 ,X)-N(x2,y2,X))+p(f(u, X)-f(v, X))\\+S\\w 1 -w 2 \\ 

r — pm 

< -^— [\\A(u) - A(v) - p(N( Xl , yi ,X) - N(x2,yi,X))\\ 
r — pm 

+p\\ - N(x 2 , Vl ,X) + N(x 2 ,y 2 ,X) + /(«, A) - f(v, A)||] + *||u>i - w 2 \\ 

< [\\A{u)-A(v)-p(N(xi,yuX)-N(x2,yuX))\\+p\\u-v-(N(x2,yi,X) 

r — pm 

-N(X2, 2/2, A))|| +p\\u-v- (/(«, A) - /( V) A))||] + 5||wi - u; 2 ||. (3.5) 

Since T, V, Q are "H-Lipschitz continuous and N is Lipschitz continuous with 
respect to first and second argument, / is Lipschitz continuous, we have 

\\N{xi,y u X)-N(x2,yi,X)\\ <P\\xi-x 2 \\ 

<m{T{u,X),T{v,X)) 
<h\\u-v\\, (3.6) 



297 



ANASTASSIOU ET AL: SET VALUED VARIATIONAL INCLUSIONS 



\\N(x 2 ,Vi, A) - N(x 2 , y 2 , A)|| < p\\ Vl - y 2 \\ 

<pU{V{u,X),V{v,X)) 
<pa\\u-v\\, (3.7) 

||/(«,A)-/(i; ) A)||<C||u-i4 (3.8) 

IK - w 2 \\ < H(Q(u, \),Q{v, A)) < £||u - v\\. (3.9) 

The a-strong monotonicity of N with respect to A,T and (3.6), we have 

\\A(u) - A(v) - p(N(xi,yi,X) - N(x 2 , yil X))\\ 2 = \\A(u) - A(v)\\ 2 - 2p(N(x 1 ,y 1 ,X) 

- N(x2,yi,\),A(u) - A(v)) 

+ p 2 \\N(x 1 ,y 1 ,X) - N(x 2 ,y l7 X)\\ 2 

< (s 2 -2pa + p 2 /3 2 y 2 )\\u-v\\ 2 . 

(3.10) 

By fc-relaxed monotonicity of N with respect to V , and (3.7), we have 
\\u-v-{N{x 2 ,yi,X)-N(x 2 ,y2, X))\\ 2 = \\u-v\\ 2 -2(N(x 2 , yil X)-N{x 2 , y 2 , X),u-v) 
+ \\N(x 2 , yi ,X)-N{x 2 ,y 2 ,X)\\ 2 
< \\u- v\\ 2 + 2k\\u- v\\ 2 + p 2 a 2 \\u-v\\ 2 

<{l + 2k + p 2 a 2 )\\u - v\\ 2 . (3.11) 

Again by (p,q)-rclaxed cocoercivity of f, and (3.8), we have 

\\u-v-(f(u, X)-f(v, A))|| 2 = \\u-v\\ 2 -2(f(u, X)-f(v, A), u - v )+\\f(u, X)-f(v, A)| 

< \\u - v\\ 2 + 2p\\f(u, A) - f(v, X)\\ 2 - 2q\\u -v\\ 2 + \\f(u, A) - f(v, A)|| 2 
< (1 - 2q)\\u - v\\ 2 + 2p( 2 \\u - v\\ 2 + ( 2 \\u - v\\ 2 

<((l-2q)+( 2 (l + 2p))\\u-v\\ 2 . (3.12) 

From (3.5), (3.9), (3.10), (3,11) and (3.12), we obtain 

\\G(u,X)-G(v,X)\\ <0\\u-v\\, 



2 



where 



(r — pm) 



[y/s 2 - 2pa + p 2 f3 2 ~f 2 +pQ]+t<l, (3.13) 



where 



and 



D = y/l + 2k + p 2 a 2 
B = ^(l-2q) + C 2 (l + 2p) 

fl = B + D,t = 6£. 

Since 6 < 1, it concludes the proof. ■ 
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Theorem 3.4 Let X be a real Hilbert space, A : X — >• X be (r,rj)- strongly 
monotone and s-Lipschitz continuous and M : X x X x L — > 2 X be (A,r))- 
monotone in the first variable with m. Let T,V,Q:XxL^2 x be the H Lips- 
chitz continuous with respect to 7, u, £. Let f : X x L — >• X be the (p, q)-relaxed 
cocoercive and Lipschitz continuous with constant £. Let N: XxXxL^X be 
a-strongly monotone with respect to T and A and /3-Lipschitz in the first vari- 
able and let N be p-Lipschitz continuous in the second variables, and k-relaxed 
monotone with respect to V. Furthermore let r\ : X x X — > X be a r -Lipschitz 
continuous. In addition if 

11 T r],M(. ,wi,\) 1 \ T n,M(.,W2,\) i \i| ,- ci| | 

\\Jp,A ( Z ) ~ J p,A Mil < °\m - W2\\, 

V (u,v, A) e X x X x L,w\ e Q(u,X),w 2 e Q(v,X). 

Then 

\\G(u,X)-G(v,X)\\<d\\u-v\\, V (u,v,X) e X x X x L, (3.14) 

where 

9= — - — Us 2 - 2pa + p 2 f3 2 - f 2 + P n]+t< l,Sl = B + D,t = 6£ 
r — pm 

B = ^(l-2q) + ( 2 (l + 2p) 
D = y/l + 2k + p?a 2 
ctT 2 -r(l-i)(ra(l-r) + Qr) 



p— 



< 



/3 2 7 2 r 2 _ ( m (! _ t ) + nT y 

^(ar 2 - r{\ - t)(m(l - t) + VLt)) 2 - (t 2 s 2 - r 2 (l - t) 2 )(^ 2 7 2 T 2 - (m(l - t) + fir) 2 ) 

^2 7 2 T 2 _ ( m ( X _ t ) + ci T y 

ar 2 > r{l-t){m{l-t)+nT)+yJ{T 2 s 2 - r 2 (l - t)2)(/3 2 7 2 r 2 - (m(l - t) + fir) 2 ) 

ut 2 > r(l - t)(m(l - i) + fir) 

ts > r(l - t) 

Pjt > m(l - t) + fir 

t< 1. 

7/ f/ie mapping A — > N(x,y,\) and A — > </^ , A — > T(u, A), A — > 

V^u, A), A — > f(u, A), A —J- Q(u, A) are continuous (or Lipschitz continuous) from 
L to X, then the solution z(A) of (3.1) is continuous (or Lipschitz continuous) 
from L to X . 
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Proof. From the hypothesis of theorem, for any A, A* e L, we have 

||z(A) - z(X*)\\ = \\G(z(X), A) - G(z(\*), A*)|| 

= \\G(z(X),X)-G(z(X*),X)\\ + \\G(z(X*),X)-G(z(X*),X*)\\ 
= e\\z(X) - z(X*)\\ + \\G(z(X*),X) - G(z(X*),X*)l 

It follows that 
\\G(z(X*),X)-G(z(X*),X*)\\ = 

||j;;r ( ""' WA * ) ' A) ' A) (^(A*))-^(x(^(A*),A),y(^(A*),A),A)+p/(^(A*),A)) 
_ j^(-M^nX)X) {A{z{y)) _ pN{x{z{xl) X *),y(z(X*), A*), A*)+p/(*(A*), A*))|| 

<ll^f ( -' tu(z(A * ) ^ ) ' A) (^(^(A*))-piV(x(z(A*),A), y (z(A*),A),A)+p/(z(A*),A)) 
-^;f ( - Mz(A * ) ' A) ' A) (^(A*))-^(x(z(A*),A*), 2 ;(z(A*),A*),A*)+ A /(z(A*),A*))|| 
+ l|j;f ( -'" (z(A * ) ' A) ' A) [^(A*))-pA r (^(^(A*),A*),y(z(A*),A*),A*)+ A /(z(A*),A*)] 
_ j , ; m(.,^(a*),a*),a- ) [^(A*))-pJV(a;^(A*), A*), y(«(A*), A*), X*)+pf(z(X*), X*)]\\ 

<— ?^l\\N(x(z(X*),X),y(z(X*),X),X)-N(x(z(X*),X*),y(z(X*),X*),X*)\\ 

r — pra 

+ \\f(z(X*),X)-f{z{X*),X*)\\] 

+ 11 J^-M^W) [A{z{X *))-pN(x(z(X*), A*), y(z(\*), A*), X*)+pf(z(X*), A*)] 

-j;J ( -' w(2(A * ) ' A * ) ' A * ) [^(A*))-^(a ; (^(A*),A*),^(A*),A*),A*)+ /0 /(^(A*),A*)]||. 
Hence, we have 

l|z(AKz(A * }11 " (r - (Z){\ - 6) ^ N (« X *^ A )'^ A *)' A )' A ) 

-N(x(z(X*),X*),y(z(X*),X*),X*)\\ + \\f(z(X*),X)-f(z(X*),X*)\\] 
l^.f ( -' Uj(z(A * KA) ' A) ^(^(A*))-^(x(z(A*),A*),j / (z(A*),A*)A*)+p/(z(A*),A*)] 



1 - 6»" "^ 

-^;f ( - Mz(A * KA * KA * ) [^(A*))-^(x(z(A*),A*),y(z(A*),A*),A*)+p/(z(A*),A*)]||. 
This completes the proof. ■ 
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Abstract 

We evaluate explicity certain quantities regarding the Brownian mo- 
tion process on the n-dimensional sphere of radius a. First we review the 
transition densities of the process. Then we calculate some probabilistic 
quantities (e.g. moments) of the exit times of specific domains. 
Key word and phrases: n-dimensional sphere, stereographic projection 
coordinates, Brownian motion, exit times, transition densities, reflection 
principle 



1 Introduction 

1.1 The n-Sphere 

Let nGN={l,2, 3, ...}. The n-dimensional sphere 5™ with center (c\, ..., c n+ i) 
and radius a > is the set of all points x € R n+1 satisfying 

(xi - ci) 2 H h (x„+i - c„ + i) 2 = a 2 . 

The most interesting case in applications is, of course, the case n = 2. For the 
sake of comparison we will also discuss the cases n = 1 (i.e. the circle) and 
n = 3. In some cases we will even consider the case of general n. 

1.2 Stereographic Projection Coordinates 

Consider the n-sphere, n > 2, 

2 i i 2 , / \2 2 

Xi + ■ ■ ■ + X n + (x n+ i — a) = a . 

To each point (xi, ..., x n , x n+ i) of this sphere, other than its "north pole" N = 
(0, ...,0,2a) we associate the coordinates 

1ax\ 2ax n 

2a - x n+1 la - a;„ + i 
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Given the coordinates (£1, ...,£ n ) of a point on the sphere with Cartesian coor- 
dinates (xi, ■■■,x n , x n +i), we have 



Xi = 



4a 2 6 



4a 2 £„ 



e i 2 + ---+e 2 +4a 2 ' 



C 2 + ---+a+4a 2 



X n +1 = 



2a(e 1 + ---+e n ) 

C 2 + ---+a+4a 2 ' 



1.3 Spherical Coordinates 

The points of the n-sphcre 



x x + ■ ■ • + x n + x n+1 = a 
may also be described in spherical coordinates (#1, ..., 9 n -i, (p) as follows: 

• For n — 1, x\ = acosip, x 2 = as'mtp, where < ip < 2tt. 

• In general for n > 2 

xi = acos9iY\™ =1 smdi, x 2 = anr=2 sm ^> x k = acos# fe _i n™=fc sm #ij 

for fc = 3,4, ...,n 

and x„ + i = acos9 n , where < 9\ < 2-7T, < Si < 7r, 

for i = 2,3, ...,n, 

1.4 The Laplace-Beltrami Operator 

In spherical coordinates: The Laplacc-Bcltrami operator of a smooth function 

i a 2 / 



f on S 1 is 



Ai/ = 



a 2 <9y 2 
The Laplace-Beltrami operator of a smooth function f on S 2 is 

1 ( fee 



A 2 / 



a z sm ip \sm ip 



(1.1) 



(1.2) 



The Laplace-Beltrami operator of a smooth function / on S 3, is 



A 3 / = 



1 



9 • 2 

a z sm ip 



sin 2 tf 2 5fl 2 ' sintf 2 90 2 V^ 2 ""^J + ^ {dtp*" 1 * 



i a 2 / , 1 d ( df 

h'd6~2 



(1.3) 
In stereographic projection coordinates: The Laplace-Beltrami operator of a 

2(n-2) 



smooth function / on S n , n > 2 is 



A„/ 



(£ 2 + ---+q+4a 2 )- 
16a 4 



Y^a 2 / 



Ee 



9/ 



^sc 2 (e 1 2 + --- + e 2 + 4a 2 )^-9e 



In particular, for n — 2 we get 



a 2/ = fe 2 + e 2 2 + 4« 2 ) 2 fs 2 / + a 2 / 



16a 4 V^ 2 5£ 2 



(1.4) 
(1.5) 
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1.5 Brownian Motion on S n 

The Brownian motion on S n , starting from x <E S n , is a diffusion (Markov) 
process X t ,t > 0, on S n whose transition density is a function P(t,x,y) on 
(0, oo) x 5™ x S n satisfying 

dP 1 

V = n A nP (1.6) 

P{t,x,y)^5 x (y) as t -> 0+ (1.7) 

where A„ is the Laplace-Bcltrami operator of S*™ acting on the a;-variables and 
S x (y) is the delta mass at x, i.e. P(t,x,y) is the heat kernel of S n . The heat 
kernel exists, it is unique, positive, and smooth in (t,x,y) [2]. 

1.5.1 Further Properties of the Heat Kernel P(t,x,y) 

It is well known that P(t, x, y) satisfies the following properties [2] 

1. Symmetry: P(t,x,y) = P(t,y,x). 

2. The semigroup identity: For any s G (0, i), 

P(t,x,y)= / P(s,x,z)P(t- s,z,y)d/j,(z) 

where dfj, is the n-th dimensional surface area. 

3. For alii > and x E S n 

f P(t,x,y)dfi{y) = l. 

4. As i — >• oo, P(t, x, y) approaches the uniform density on S n , i.e. 

lim P(t,x,y) = — , 

where A n is nth dimensional surface area of S n with radius a. It is well 
known that [3] 

2 n k + i a 2k+i 2 2k (k-l)\n k a 2k 

A2k+1 = (fc)! ' and Mk= (2fe-l)! "' fcGN 

Finally, the symmetry of S n implies that P(t, x, y) depends only on i and d{x, y), 
the distance between x and y. Thus in spherical coordinates it depends on i 
and the angle ip between x and y. Hence 

P(t,x,y) =p(t,ip), 

where p(t, ip) satisfies 



dp _ 1 . 1 

— - -A nP - ^ 



dp dp 



(n — 1) cot ip ■ — h 



dip dip' 



(1.8) 



and 



lim aA n _ip(t, <p) ■ sin™ l ip = S(tp). (1.9) 



Here <5(-) is the standard Dirac delta function on 
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2 Explicit Form of the Heat Kernel 

Reminder (Poisson Summation Formula). Let f(x) be a function in the 
Schwartz space <S(R), where <S(R) consists of the set of all infinitely differentiablc 
functions / on K so that / and all its derivatives /^ are rapidly decreasing, in 
the sense that 



sup l^l" f (l) (x) 



rfel 



< oo for every k, I > 0. 



Then 



raSZ nSZ 

where -F(£) is the Fourier transform of f(x), i.e. 

^+00 



^(0 



/(x) exp(— i£x)dx, £ G 



2.1 The Case of S 1 

Proposition 2.1 The transition density function of the Brownian motion X tl t > 
on S 11 ot£A radius a is the function 



M^) = ^— I>xp 



nGZ 



n 2 i 

2 or y 7ra 



rn ^ — ' 



».ef 



n t 

ox]) ( _ 2^ ) cos ( n ^) 



equivalently 






1 

2-7ra' 
(2.1) 

(2.2) 



For the proof see [5] . 

2.2 The Case of S 2 

We remind the reader of Legendre Polynomials P n (x), n = 0, 1, 2, ... since 
we are going to use them later in the paper 

Proposition 2.2 77ie transition density function of the Brownian motion X t , 
t > on S 2 with radius a is given by the function 



p(t, <p) = - — 2 Yl ( 2n + x ) cx p 



n(n + l)Vi 



P n (cos<p). (2.3) 



For the proof see [1] or [5]. 
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2.3 The Case of S 3 

Proposition 2.3 Let X tl t > be the Brownian motion on a 3- dimensional 
sphere S 3 of radius a. The transition density function p(t,<p) of X t is given by 

v ; r nSZ v ' 

where Z zs the set of all integers. Equivalently, 

i v-^ ( t( n2 — 1) \ 

P(i '^ ) = -4^^in^^ neXP ( 2^+^V' (2 ' 5) 

/ x 1 v- . / x / £(n 2 - 1)\ 

pit, y ) = _. o . > nsin(n<p)exp — -= — . (2.6) 

y ' 2n 2 a 3 sm(p ^ \ -r i r \ 2a? I 

The function p(t, ip) is analytic at ip = and </? = 7r. In fact 

p(t, 0) = hm P^J^^E"^ 2^ 

and 

t(n 2 - 1) 



p(t, 7r) = lim_ p(t, <p) = 3 J^ n 2 (-l)™ cxp I - 



27r 2 a 3 ^ v ' " V 2a 2 

For the proof see [5] . 

Reminder. The $3 function of Jacobi is 

00 
$3(z, r) = 1 + 2 >^ exp (z7rrn 2 ) cos(2n2:), 

where r € C, with Im{r} > 0. It follows that 

P{t > V) = -4^3 sin ^ CX P (^J ^ 3 ( 2 ' 2a^J ' 

3 Expectations of Exit Times 

Let X t be the Brownian motion in S n and D a Borel subset of S n . The random 
variable 

T = inf{t > 0| X t i D} 

is called the (first) exit time of D. 

Reminder. If u(x) = E X [T], then u(x) satisfies 

^A n u = -1, u\ dD = (3.1) 

(see, e.g., [8]) 
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Proposition 3.1 We consider the 2 -dimensional sphere S 2 of radius a. Let 
two circles pass through the North pole, such that in stereographic coordinates 
are represented by the parallel lines 6 = b and 6 = c, where b,c € K, say 
b < c. We consider the set D in S 2 , whose stereographic projection is 

£> = {(6,6)l6eR and &e(6,c)}. 



Of course 



dD = {(6,6) I £1 € R and £ 2 = b or 6 = c} . 



// Xj is i/ie Brownian motion on S 2 of radius a starting at the point A, where 
the stereogrpaphic projection coordinates of A are (6:6) € -D and 



then 
where 

/(6,6) 



T = inf{i> 0|X t e D}, 
E A [T] = /(6, 6) - 2a 2 In (6 2 + £ + 4a 2 ) , 



(3.2) 



1 

7r./„ 



ff(r? , c )exp(^)sin(^) 



exp 



c-6 



2 ( irjb-b) 
c-b 



^]> l ^)cos(^|^ j + / / 



rf;/ 



^,6)exp(^)sin(«) 



exp 



and 

Proof 

The function 



(Sfc) ^ (^) + (exp (£) cos (* 
(c-b) 2 ln 2 |e| 



(I2-6) 



3-6 



;(6*) = 2a 2 In 



t 2 + 4a 2 



d?/ 
(3-3) 

(3.4) 



£ A [T] = t/(6,6) 



satisfies the differential equation 



:A 2 U = -1 



with boundary conditions 



U(Si,b) = Ufa,c) = 0. 



Here A2 is the Laplace-Beltrami operator on S 2 expressed in the stereo- 
graphic projection coordinates (see (1.5)). Hence the differential equation takes 
the form 

d 2 U d 2 U _ 32a 4 

^r + M __ (6 2 +6 2 + 4a 2 ) 2 ' ( ' 
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However the function 

f/i(£i,6)=-2a 2 ln(^+£ 2 2 +4a 2 ) 

satisfies the differential equation (3.5). Thus 

t/(6,6) = -2a 2 ln(C 2 + el +4a 2 ) + /(6,6), (3.6) 

where /(^i,^2) satisfies 

with boundary conditions 

/(£i,6) = 2a 2 ln(£ 2 + o 2 +4a 2 ) and /(&, c) = 2a 2 ln(£ 2 + c 2 + 4a 2 ). (3.7) 

If we make the change of variables x = £i and y = £,2 ~ b and set the 
function <fi(x,y) = / (^1 , ^2 ) ? then 4>{x,y) satisfies 

^1 + ^1=0 

dx 2 dy 2 

with boundary conditions 

4>(x,0) = 2a 2 \n(x 2 + b 2 +4a 2 ) and 0(x, /3) = 2a 2 ln(x 2 + c 2 + 4a 2 ), 

where /3 = c — 6. 

Now let z = x+yi and iu = exp ( ■¥ ) , i.e. z — nw . Thus, if w = u+vi, u, w € 

R then 

firx\ ( ny\ f irx\ . f iry\ , . 

" = cxp I — I cos I — I and t> = exp I — 1 sin I — 1 . (3.8) 

Introducing the function ip(u,v) — <p(x,y), it follows that ip(u,v) satisfies 

S 2 ^ 9 2 '0 



9 m 2 9 v 2 
with boundary conditions 



= 0. 



^(u, 0) = 2a 2 In ( ^ ln u + b 2 + 4fl 2 ) for u > , 



and 

?2i„2i 



V^,0) = 2a 2 ln( ^ ln J M| +c 2 +4a 2 ), for M <0 



?v 



This is the standard Dirichlct boundary value problem for the half plane and it 
is well known [6] that its solution is given by the Poisson integral formula for 
the half-plane: 

1 r°° W>(g,o) 

K J -co V 2 + (U- £) 2 

or 

,< \ l f° v ^ c ) it 1 X r t;g($,&) ^ 
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where 



g& t) = 2a 2 In (^^ + 1 2 + 4aA . 



Notice that g(— £, t) = g(£,t). Hence 

g(g,c) 

„2 



ip(u,v) = 



1 

— n 



fOC 



y7T Jo \v 2 + (u + £,) 2 v 2 + 



where u,v are given in (3.8). Therefore 



$w* 



«x,y) = ±ew(f)*n(j)J o 



OO 



g(v,c) 




g(v,c) 

v exp(^)sin 2 (f) + (cxp(^ 

g(v, b) 



f Jcosl 



?)+*) 



i.e. 



i r 00 
/&,&) = -/ 



,-P (^ 



O ) sin2 (?) + ( CX P (f 
g fac)exp(git) S in(^) 



\ 



(^)cos(f)-,) J 



dt], 



I - 2 (^ 



(cxp(^)cos v , 



c-b J 



■v) 



dt] 



n Jo 



■DO 



Therefore 



expfe 



g(v,b)exp\^J si 



^)sin 2 (^>) 



sin(^ 



(cxp(^)cosl 



^H) 2 . 



d/t). 



E A [T] - /(&,&) - 2a 2 In (£ + £ 2 + 4a 2 



4 Hitting Probabilities 

Let X t be the Brownian motion in S n , D C S n , and T its exit time. 
Reminder. If T C dD and m(x) = P X {X T e T} then [4] u(x) satisfies 

A„u = 0, u| r = 1, u\dD\r = °- 

Proposition 4.1 We consider the 2 -dimensional sphere S 2 of radius a. Let two 
circles passing through the North Pole, such that in stereographic coordinates are 
represented by the parallel lines £2 = b and £2 = C, where b, c G R, with b < c. 
We consider the sets Di,D2 in S 2 , whose stereographic projection are 

A = {(&,&) I &eR,6e(6,+oo)} and £>2 = {(£i,6)l £1 eR,6 € (-oo,c)}. 

0/ course, 
dD 1 = {{Z 1 ,£ 2 )\SieR,S2 = b} and 8D 2 = {(&,&) | £1 € R,& = c} . 
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Let X t is the Brownian motion on S 2 of radius a starting at the point 
A, where the stereographic projection coordinate of A are 

(a,6)e£>in£> 2 . 

T 1 =w£{t>0\X t $D 1 },T 2 = inf {t >0\X t ^D 2 } 
T = inf {t > | X t $ £>i n D 2 } , 



// 

and 

then 



P A {T = T 1 }= C —^- and P A {T = T 2 } = ^—^ . (4.1) 

c — b c— b 



Proof 

The function 

u(Z 1 ,Z 2 )=P A {T = T 1 } 

is the unique solution of the differential equation 



2 < A,*-", (4-2) 



or (see (1.5)) 

d 2 u d 2 u 

with boundary conditions 

u(^i,6) = l and u(Ci,c)=0. (4.3) 

Since (4.2)-(4.3) has a unique solution, (4.1) follows immediately. ■ 

Remark 4.1 In stereographic coordinates a function is harmonic with respect 
to A2, (the Laplace- Beltrami operator of S 2 ), if and only if it is harmonic 
with respect to the standard Euclidean Laplacian. This fact is not true for 
S n ,n>3. 

5 Reflection Principle on S n and Applications 

The following notation will be used in Theorem 5.1. 



Definition 5.1 For every A — (xi, x 2l . . . , x n+ i) G S n we denote by A the 
point (Xi,X 2 , ■ ■ ■ ,— X n +i) <= S n , namely the symmetric of A with respect 
to the (xi,x 2l . . . ,x n )-hyperplane. 

Theorem 5.1 Let X t , t > be the Brownian motion on a n-dimensional 
sphere (n>2), S n of radius a starting at the point 

A = {6 1 ,...,e n , v )eD, 
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where 

D = {(0 1 ,...,e n _ 1 ,<p)€S n \ 1 e[O,27r),0 i e[O,7r] for i = 2,...,n-l and ¥>e(|,7r]} 

T = inf{i>0| X t <£ D}, 

then 

P A {T < t} = 2P A {X t £ D] . (5.1) 

Proof. 

P A {T <t} = P A {T < t, X t i D} + P A {T <t,X t eD}. (5.2) 

However, if X t £ D then of course T < t. Thus, 

P A {T <t,X t £D} = P A {X t £ D} . (5.3) 

On the other hand, if we set 

X t , if t<T; 



Xl ' X t , if t>T, 

where X t is given by Definition 5.1, then by the strong Markov property of 
X t , X t and X t have the same law. Hence, 

P A {T <t,X t eD} = P A {r < t, X t € £>} , 

but if X t £ D then X t £ D. Hence, 

P A {t <(,i f efl} = P A {T <t,X t £D}, 

or 

P A {r<t,X t £])} =P A {X t £D}. (5.4) 

Therefore from (5.2), (5.3) and (5.4) we obtain that 

P A {T <t} = 2P A {X t £ D} . 

m 

In the case of S* 1 we can prove the next result in a similar manner. 

Theorem 5.2 Let X t , t > be the Brownian motion on a 1- dimensional 
sphere S 1 of radius a starting at the point ip £ D, where 



If 
then 



D = (7r,2vr). 

T = inf{i>0| X t i D}, 
P^ {T <t} = 2P V {X t i D} . (5.5) 

10 
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5.0.1 Applications of the Reflection Principle 

The reflection principle can help to calculate the distribution functions of cer- 
tain exit times. 



The case of S 1 

Let X t be the Brownian motion on a 1-dimcnsional sphere S 1 of radius 
a starting at the point ip. If D = (ir, 2ir) , then 



/■7T fTV — <p 

P{X t (£D}= a-p(t,x-<p)dx= a- 

JO J-v 



p(t,y)dy, 



where p(t, p) is the transition density function of the Brownian motion on S 1 
of radius a. Hence, form (2.1) 



/-K-if r i / 
a — V exp 



■U) cos{ny) ) i 



dy, 



or 



1 1 



P{X t tD} = - 9 + -E 



a eK 



cxp 



n z t 
2d 2 



cos(ny)dy 



Therefore, 



''!^^ = 1 + :E 



2 7T 



nGN* 



exp 



n 2 t\ sin(ri7r — nip) + sm(ntp) 



2a 2 



n 



i.e. 



Thus 



''w^i^ + iE 



2 7T 



TieN* 



exp 



P{X t ^£»} = ^ + £ £ 



1 2 

2 + ^ 



n 2 £^ sin(np)(l- (-1)") 
"2^2 



cx p(~l?) sin ( n< />) 



n odd 



It follows (by using Theorem 5.2) that, if T = inf{i > 0| X t £ D}, then 



p v {t < t} = i + 4 V - 

7T fa ^ n 



nH 



, - /xp^-pinM. 

n odd 



(5.6) 



for every ip € (7r, 27r). 



The case of S 2 

Let Xj be the Brownian motion on a 2-dimensional sphere S 2 of radius 
a starting at the point N(0, 0) in spherical coordinates. If 

D = [(8,p)eS 2 \ ee[0,2n),pe (|,tt]} 



then 



P N {X t $D}= / p(t, <p)a 2 sm(tp)d6dp, 

Jo Jo 



11 
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i.e. 



P N {X t iD} = 2ira 2 / p(t, if) sm(<p)d<p, 
Jo 

where p(t, if) is the transition density function of the Brownian motion on 
5 2 of radius a. Hence from (2.3) 

P N {X t 4 D} = 2na 2 [' -^ siny V(2n+1) cxp ( - ^ + ^^ \ P n (cos<p)d<p, 
j n 4ira 2 ^-^ V a I 

J " nGN v / 

or 

P N {X t {D}= l -+ l - £ (2n+l)cxp f_ n ( n + 1 )^ i \ T P„(co S ^)sin(^)#. 

„CM* \ /Jo 

(5.7) 



raeN 

However for every n£N 



1=1 P n (cos(p)sm(ip)dif = / P n (x)dx. 
Jo Jo 

It is known that (see [7]) 

Pn(x) = ^j [Pn+l(x) - P n -l(x)} . 

Thus 

I = W^~T (Pn+lW - Pn-lW - iWl(O) + Pn-l(0)) , 

2ra + 1 
or 

/=^ T (P„-i(0)-P„ +1 (0)). 



It is also known that for every n G N* 



P 2n(0) = (-1)»J^ and P 2n+1 (0)=0. 

Thus, if n is even then 7 = 0. 
If n is odd, i.e. n = 2k + 1, then 

J=4J^(^ fc (0)-P 2(n+ i)(0)), 

(-l)"(2fc)!(2fc + 3) 

' ~ (4fc + 3)2 2fe + 1 /c! ' 

From (5.7) and (5.8) we obtain that 



(5.8) 



p»w^>=i + iD-i)-°-p(- (2 ° +i "r +i)i/i )- g ^ a - 

(5.9) 
Furthermore, if 5(0, 7r) namely the south pole of 5 2 , then 

P s {X t iD} = P N {X t iD} = P N {X t €£>} = !- P N {X t 4_ £>}. 



12 
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Therefore, 
P S {X ^ D} = 1_^ { _ IT ^( (2n + l)(2n + l)Vt\ (2n)!(2n + 3) 



2 2^ v ' *\ a J 2 2n + 1 n\ 

(5.10) 
Theorem 5.1 implies that, if T = inf {t > | X t £ D} , then 

pHt < t} . i- B-T-p p" + '^ + ^ y<j|gi±l>. ( , u) 

riGN ^ ' 

The case of S 13 

Let X t be the Brownian motion on a 3-dimcnsional sphere S* 3 of radius 
a starting at the point N(0, 0, 0) in spherical coordinates. If 

D={(d 1 ,6 2 ,L P )€S 3 \ 1 g[O,27r),fl 2 e[O,7r], V e (f,^]} 
then 

P N {X t (£ D}= J* J J p(t,ip)a 3 sin6> 2 sin 2 (ip)d6 1 de 2 d(p, 
Jo Jo Jo 

i.e. 

P^jXt ^ D} = 4ira 3 / ' p(t, <p) sm 2 (<p)d<p, 
Jo 

where p(t, ip) is the transition density function of the Brownian motion on 

S 3 of radius a. Hence from (2.6) 



P N {X t £D} = 4*a 3 Jj sin2 (^) 27r 2 a3 1 sin(y) E nsin ^) ex P 



2a 2 

n6N 



t(n 2 - 1) 



dtp, 



or 

,2 



P^lXt ^£>}= 2 Vncxpf- ^^ „ ^ / 2 sin(p) sin(np)dp. (5-12) 



nGN 

Let us call 

1=1 sin(<^) sin(n<^)d<p. 
Jo 

If n = 1, then I = j- If n > 1, then 

r ncos(^) 



ri 2 — 1 
Thus from (5.12), 



1 2^ 9 / £(n 2 -l)\ /nTT 

"- ■■- ' ' cos ' 



n=2 v 

However, cos (^ ) = for every n odd, hence 

Mr , ^ 1 2 v-^ 9 / tin 2 — 1)\ /mrx 



n even 



13 
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or 

P»{X* *D} = \ ~ ~ E (-D"» 9 «P l-^^ 1 ) ^ 

71 ti6N« ^ ' ' ' 

Furthermore, if S — (0, 0, w) then, 

P s {X t iD} = P N {X t iD} = P N {X t e D} = 1 - P^X* £ £>}. 
Therefore, 

P*{X t £D} = \ + I £ (-l)"- 2 oxp (-^^) • (5-14) 

Theorem 5.1 implies that, if T = inf {t > | X t £ D} , then 

^<*} = l + ^E(-l)Vex P (-^^Y (^5) 
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Abstract 

General constitutive relationships of the linear theory of viscoelas- 
ticty , which contain fractional derivatives, are considered. It is proved 
how these constitutive relationships can be written in the form of an 
integral dependence between stress and deformation. 
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Let us consider a uniaxial stressed state, then the dependence between 
stress and deformation is written in the form (see [1]) 

(1 + Ef=i^)(r + YZ=ibkDS)(r(t) = 

= [(l + Ep=i^)(Ao + E/=iA^Mt)+ (1) 

+2[(1 + Ef=i a k D^)(n + Ef =1 HiD a <)e(t). 

Here D a denotes a fractional derivative in the Riemann-Liouville sense. 
We remind that a fractional integral in the Riemann-Liouville sense is 
defined by the formula 

I a f(t) = -L / (x - rr- l f\r)dr, t > 0, a G R + , 
r(a) Jo 

and a fractional derivative by the formula 

1 /" f(r)dr 



7 

D a f(t) ' 



dt r 



T(m - a) Jo (t-r 



vQ+l- m 



m — 1 < a < m, 



317 



SURGULATZE: VISCOELASTICITY WITH FRACTIONAL DERIVATIVES 



D a f(t) = —,a = m. 
JU dt m 

Let at the initial moment of time 

e (0) = cr(O) = 0. (2) 

Also assume that the indexes of fractional derivatives satisfy the relations 



Pk > Pk-i > ■ ■ -Pi >= 0, 
&> >&>-!>•; -A >=0, 

Xj > Aj_i • • • Ai > 0, 
ai > «i_i > • • ■«! > 0. 

Let the following inequalities be fulfilled 



(3) 



P k + P p ^l,k = l,K-p=l,P; 



/3 p + \ j ^l,p=l,P;j = l,J; (4) 



Pk + a^ l,k = l,K;l = l,L. 

Note that condition (4) is fulfilled in all practical cases. 
The following Theorem is valid: 

Theorem: Let in the case of uniaxial stress, constitutive relationships 
for uniform isotropic materials be written in form (1). If conditions (3) and 
(4) are fulfilled, then relation (1) is equivalent to the relation 

e — n * da, (5) 

where the function U is expressed as a series of Mittag-Leffter type functions. 
Proof: As is known ([1]), all [3k,Xj,ai and (3 p < 1, therefore, as shown 
in [2], for the fractional derivatives in the left- and right-hand part of (1) we 
can apply the composition rule, i.e. it can be assumed that the following 
equalities hold: 

D$hDi*p = d^+Pv, k = \7K\v = M 5 ; 

D P pD \j = D^+' x i,p = LP; j = 177; 
£>&£>«« = £>&+*« , k = lj<; I = LX. 

From (2) and (3) it follows that for the Laplace transform of fractional 
derivatives the following formula is valid 

n— 1 

£{Dif} = s«£{f} - Y, ^- 1_fe /(0); n - 1< q < n. (6) 

k=o 
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In [3] it is shown that the following two conditions are equivalent: 

DP-'f(a) = 0, j = l,2,---,m; m - 1 < p < m; (7) 

f(a) = 0, j = 0,l,2,...,m-l. (8) 

Taking into account conditions (2) and (4) and the equivalence of condi- 
tions (7) and (8), after the Laplace transform of (1) we obtain 



i+ z;=i ftp** ^o + z;=i x jS 



+2 



-K 



1 + E fe =i a k s Pk )[IM> + Ej=i ^ sa 



e+ 



0) 



where a and e is the Laplace transform of stress and deformation, respec- 
tively. 

After multiplying and combining the identical terms, we can rewrite (9) 
in the form 



M 



N 



J2 c m s 7rn a = J2 d nS Un e, 



m=0 



ra=0 



where 



1m > 7m-i > • • • > 71 > 7o = 0; 

vn > vn-i > ■ ■ • > v\ > vq — 0; 

1m = Pk + ftp, v N = max{/3 P + Xj, p K + a L }; 

7o = v = 0; c = 1; d = A + 2/i . 



(10) 



;n) 



c m ,m = 1, M are explicitly expressed through Ofe, k = 0,K and 6 p ,p = 0, P; 
and d n ,n — 1,N are explicitly expressed through Ofe, /c = 0,K;b p ,p = 
0~P; Xjj = 0~J //,, / = 071. 

As shown in [4], for relation (1) to have a physical meaning, it is necessary 
that the following inequality be fulfilled: 



Vn > 7m- 



From (9) we obtain 



M 
m=0 



S 



lm 



N 

E d nS V 
ra=0 



-a. 



12) 



(13) 
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Since the inverse Laplace transform is linear, to obtain the interdepen- 
dence of a(t) and e(t) , it suffices to consider one summand in the right-hand 
part of (13). The reasoning for other summands will be analogous. Let this 
summand have the form 

-a, (14) 



N 

E dn-S l 
n=0 



where ^ r ^ M, r is fixed. 

Denote by e lr the deformation defined by (14). It is obvious that e can 
be represented as a sum of summands 



£ 7r — C r N 



-a. 



(15) 



E d nS^ 



n=0 



The validity of the following formula is proved in [3] 



oo 



iE(-i) m E E (m;k ,h, 

k + h + ■ ■ ■ + K-2 = m 
k ^ 0, • • -,k n _ 2 > 



k„- 2 )>< 



m=0 



n-2 , 

x n f- 



n-2 
•s i=0 



an 



By the latter formula we obtain 



£ 



C 



7r ^ r d Jv s' / iV+d JV _ lS I 'JV-l + ... + ( i lS n+d oS ''o 



(7 



oo 

&£(-i) r 

m=0 



E 

&o + h + ■ ■ ■ + k N _ 2 



(m;ko,h,- ■ -,k N - 2 ) x 



/;/ 



(16) 



fcn ^ 0, • • •, k 



•,KN-2 



N-2 , 

x n Gfe 

j=0 x 



JV-2 
77— "iV-l + E { v 2~ v N-\) k i 

S J-° 

S "N- V N-1+- 



> o 



a. 



We remind that the Mittag-Leffler function is the function 



E a (z) = J2 



k=0 

4 



r(ak + l)' 
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where T(z) is Euler's function. The Mittag-Leffler function is an entire func- 
tion of order p — - and type 1. If a is a complex number, then the order 
is p = -^.The generalization of this function is the two-parameter Mittag- 
Lefner function 



OO J, 



^w = £f(^T^ a,(3>0 - 



C {t^-^l (±at")} = J'* , Res > \a\^. (17) 



The following formula is valid (see [3]) 

k\s a ~ f! 
(s a T a) 

With (17) taken into account, from (16) we obtain 

e lr = n 7r * a, (18) 

where * is the convolution symbol and 

IV = £ E™=o(-l) m E (jn; k 0) k u ■ • •, k N . 2 ) x 

k + ki + ■ ■ ■ + k N - 2 = m 
k ^ 0, • • •, k N _ 2 > 



*nS 2 (*)< 



JV-2 

m{v N -v N _{)+v N - J2 (vj-vn -i)kj-l-7r 

j= o x 



X£ jv_ 2 [_%i^iV-^-l 



It is not difficult to verify that the primitive function 

JV-2 
m(^jv-^Ar_l)+^jV- J] (vj-VN-^kj-l-Jr 

E™(t)=t -° x 

X£ JV-2 ^_ ^=1^-^-1 

j=0 

is the function 

JV-2 

m(v N -v N -i)+v N - Y (vj-VN-i)kj-yr 

E™(t) = t -° x 

X£ JV-2 (_%=I^JV-^ 



"n-vn-i,vn- Y (^ 3 -^jv-i)fej-7r+l v 

3=0 



(19) 
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which satisfies the condition 

K(o) = o. 

Indeed, using the formula 

^ 1 



we obtain 



C{ J f(t)dt^ = -C{f(t)}, 



JV-2 
1 _ , , S i=° 



£{J o K(t)*t} = ]c{K(t)} 



, ^ d N ) 

Hence, applying (17), we obtain 



m+l 



E™ r {t) = j t El{r)dT = C- 1 



JV-2 
7r-"JV-l+ S ("j— "iV-l) fc j _1 
S ^ 

j _„ ~ m+l 



JV-2 

m(v Ar -^iv_i)+^Ar- £ (vj-VN-l)kj—y r 
t '=° X 



-C/ JV-2 

vn-vn-i,vn- J2 0j-^JV-l)fcj-7r+l 



/ dN-i ^u N -u N _ 1 \ 

-l\ dN J ' 

3=0 " J 

To prove that 

K(o) = o, 

we use the limit theorem of the Laplace transform 



lim/(t)= lim s£{f(t)}. 

t^O s—>oo 



Thus we obtain 



JV-2 
~/r-VN-l+ J2 (^-^JV-l)fej-l 

lim E™(t) = lim — -=-. 

' g^JV-^JV-l -|_ " Af - 1 

From (11) and (12) the validity of the following inequalities follows 
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N-2 

^2(Vj - VN-l)kj ^ 

3=0 

and 

V N - Z/at_i > 7 r - VN-1- 

These inequalities imply that 

By virtue of the above-said and the condition c(0) = 0, after performing 
the operation of integration by parts we can write (18) in the form 

e lr = n 7r * da, (20) 

where il 7r is different from II 7r because in the sum the functions -EI^(t) are 
replaced by the functions E™(t). 

Summing (20) over r = \M , we ascertain that our theorem is true. 

Corollary: // conditions (3) and (4) are fulfilled, then relation (1) can 
be written in the form 

a = R*de, (21) 

where the function R is expressed as a series of Mittag-Leffler type functions. 
Proof:Indeed, as is known from [4] and [5], the following equality is 
fulfilled 

£{R}£{fl} = 1 (22) 

On the other hand, 



Hence 



But, as is known from [4], 7m + 1 ^ vjq. Now to prove the validity of (21), 
it suffices to apply the same reasoning as that we have used for the function 

n. 



£{n} = 


c 

s 




N 


i 


n=0 


sC{U} 


M 

E c m s^ +1 

m=0 



7 
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